Towards Causal Relationship in Indefinite Data: Baseline Model and New Datasets
Hang Chen, Xinyu Yang, Keqing Du

TL;DR
This paper introduces new datasets and a probabilistic baseline model to address the challenges of learning causal structures from complex, indefinite data like dialogue and video, which contain multiple structures and values.
Contribution
The paper releases two high-quality datasets, Causalogue and Causaction, and proposes a novel probabilistic framework to handle multi-structure, multi-value indefinite data for causal discovery.
Findings
Datasets enable causal analysis in dialogue and video data.
The probabilistic model effectively captures causal structures and representations.
Experimental results demonstrate the model's ability to disentangle causal effects and confounders.
Abstract
Integrating deep learning and causal discovery has encouraged us to spot that learning causal structures and representations in dialogue and video is full of challenges. We defined These data forms as "Indefinite Data", characterized by multi-structure data and multi-value representations. Unlike existing adaptable data forms, Indefinite Data still faces gaps in datasets and methods. To address the dataset gap, we release two high-quality datasets - Causalogue and Causaction, containing text dialogue samples and video action samples with causal annotations respectively. Moreover, the method gap arises from the coexistence of multi-structure data and multi-value representations, breaking the assumptions of all current methods and rendering them infeasible on Indefinite Data. To this end, we propose a probabilistic framework as a baseline, incorporating three designed highlights for this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Graph Neural Networks · Topic Modeling
