TraVLR: Now You See It, Now You Don't! A Bimodal Dataset for Evaluating Visio-Linguistic Reasoning
Keng Ji Chow, Samson Tan, Min-Yen Kan

TL;DR
TraVLR introduces a synthetic visio-linguistic dataset with multiple reasoning tasks, designed to evaluate models' ability to understand and transfer concepts across visual and linguistic modalities, especially in out-of-distribution scenarios.
Contribution
The paper presents TraVLR, a novel synthetic dataset with multiple reasoning tasks and evaluation settings, enabling detailed analysis of visio-linguistic models' cross-modal transfer capabilities.
Findings
State-of-the-art models perform well within the same modality.
Models struggle with cross-modal transfer tasks.
Adding or removing modalities limits model success.
Abstract
Numerous visio-linguistic (V+L) representation learning methods have been developed, yet existing datasets do not adequately evaluate the extent to which they represent visual and linguistic concepts in a unified space. We propose several novel evaluation settings for V+L models, including cross-modal transfer. Furthermore, existing V+L benchmarks often report global accuracy scores on the entire dataset, making it difficult to pinpoint the specific reasoning tasks that models fail and succeed at. We present TraVLR, a synthetic dataset comprising four V+L reasoning tasks. TraVLR's synthetic nature allows us to constrain its training and testing distributions along task-relevant dimensions, enabling the evaluation of out-of-distribution generalisation. Each example in TraVLR redundantly encodes the scene in two modalities, allowing either to be dropped or added during training or testing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
MethodsBest Ways to Get in Touch with Expedia When You’re Dealing with Travel Issues or Booking Errors · 30 Ways Contact Expedia Customer Service Using Phone, Email, Chat, and App-Based Support Options
