Learning Causality for Modern Machine Learning
Yongqiang Chen

TL;DR
This paper explores how incorporating causal inference principles, specifically the invariance of causal mechanisms, can improve out-of-distribution generalization, interpretability, and robustness in modern machine learning models.
Contribution
It introduces a framework leveraging the invariance of causal mechanisms to enhance OOD generalization, interpretability, and robustness, especially applied to graph-structured data.
Findings
Causal invariance improves OOD generalization.
Learning causality enhances model interpretability.
Causal approaches increase robustness to adversarial attacks.
Abstract
In the past decades, machine learning with Empirical Risk Minimization (ERM) has demonstrated great capability in learning and exploiting the statistical patterns from data, or even surpassing humans. Despite the success, ERM avoids the modeling of causality the way of understanding and handling changes, which is fundamental to human intelligence. When deploying models beyond the training environment, distribution shifts are everywhere. For example, an autopilot system often needs to deal with new weather conditions that have not been seen during training, An Al-aided drug discovery system needs to predict the biochemical properties of molecules with respect to new viruses such as COVID-19. It renders the problem of Out-of-Distribution (OOD) generalization challenging to conventional machine learning. In this thesis, we investigate how to incorporate and realize the causality for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
