Causal reasoning in typical computer vision tasks
Kexuan Zhang, Qiyu Sun, Chaoqiang Zhao, Yang Tang

TL;DR
This paper reviews how causal reasoning can improve computer vision tasks by addressing the limitations of correlation-based deep learning, highlighting recent methods and future directions.
Contribution
It provides a comprehensive review of causal methods in vision and vision-language tasks, summarizing their advantages and proposing future research roadmaps.
Findings
Causal methods help avoid spurious correlations in vision tasks.
Causal paradigms enhance robustness and interpretability of models.
Future work includes extending causality to complex scenes and systems.
Abstract
Deep learning has revolutionized the field of artificial intelligence. Based on the statistical correlations uncovered by deep learning-based methods, computer vision has contributed to tremendous growth in areas like autonomous driving and robotics. Despite being the basis of deep learning, such correlation is not stable and is susceptible to uncontrolled factors. In the absence of the guidance of prior knowledge, statistical correlations can easily turn into spurious correlations and cause confounders. As a result, researchers are now trying to enhance deep learning methods with causal theory. Causal theory models the intrinsic causal structure unaffected by data bias and is effective in avoiding spurious correlations. This paper aims to comprehensively review the existing causal methods in typical vision and vision-language tasks such as semantic segmentation, object detection, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Domain Adaptation and Few-Shot Learning · Explainable Artificial Intelligence (XAI)
