A Recipe for Creating Multimodal Aligned Datasets for Sequential Tasks

Publication
Association for Computational Linguistics

Related