CausalChaos! Dataset for Comprehensive Causal Action Question Answering   Over Longer Causal Chains Grounded in Dynamic Visual Scenes

Paritosh Parmar; Eric Peh; Ruirui Chen; Ting En Lam; Yuhan Chen,; Elston Tan; Basura Fernando

arXiv:2404.01299·cs.CV·June 17, 2024·2 cites

CausalChaos! Dataset for Comprehensive Causal Action Question Answering Over Longer Causal Chains Grounded in Dynamic Visual Scenes

Paritosh Parmar, Eric Peh, Ruirui Chen, Ting En Lam, Yuhan Chen,, Elston Tan, Basura Fernando

PDF

Open Access 1 Repo 1 Video

TL;DR

CausalChaos! is a new challenging dataset based on cartoons that enables research on complex causal reasoning in visual question answering, emphasizing multi-level causal chains and explanations in dynamic scenes.

Contribution

We introduce CausalChaos!, a novel dataset for causal video question answering with multi-level causal questions, grounded in animated scenes, and include hard incorrect answer mining to challenge models.

Findings

01

Models perform reasonably but need improvement on open-ended answers.

02

The dataset reveals the need for advanced causal relationship modeling.

03

Grounded in cartoon scenes, it offers clear causal signals for reasoning.

Abstract

Causal video question answering (QA) has garnered increasing interest, yet existing datasets often lack depth in causal reasoning. To address this gap, we capitalize on the unique properties of cartoons and construct CausalChaos!, a novel, challenging causal Why-QA dataset built upon the iconic "Tom and Jerry" cartoon series. Cartoons use the principles of animation that allow animators to create expressive, unambiguous causal relationships between events to form a coherent storyline. Utilizing these properties, along with thought-provoking questions and multi-level answers (answer and detailed causal explanation), our questions involve causal chains that interconnect multiple dynamic interactions between characters and visual scenes. These factors demand models to solve more challenging, yet well-defined causal relationships. We also introduce hard incorrect answer mining, including a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lunaproject22/causalchaos
pytorchOfficial

Videos

CausalChaos! Dataset for Comprehensive Causal Action Question Answering Over Longer Causal Chains Grounded in Dynamic Visual Scenes· slideslive

Taxonomy

TopicsMultimodal Machine Learning Applications · Human Pose and Action Recognition · Anomaly Detection Techniques and Applications

MethodsFocus