Learning Causality for Modern Machine Learning

Yongqiang Chen

arXiv:2506.12226·cs.LG·June 17, 2025

Learning Causality for Modern Machine Learning

Yongqiang Chen

PDF

Open Access

TL;DR

This paper explores how incorporating causal inference principles, specifically the invariance of causal mechanisms, can improve out-of-distribution generalization, interpretability, and robustness in modern machine learning models.

Contribution

It introduces a framework leveraging the invariance of causal mechanisms to enhance OOD generalization, interpretability, and robustness, especially applied to graph-structured data.

Findings

01

Causal invariance improves OOD generalization.

02

Learning causality enhances model interpretability.

03

Causal approaches increase robustness to adversarial attacks.

Abstract

In the past decades, machine learning with Empirical Risk Minimization (ERM) has demonstrated great capability in learning and exploiting the statistical patterns from data, or even surpassing humans. Despite the success, ERM avoids the modeling of causality the way of understanding and handling changes, which is fundamental to human intelligence. When deploying models beyond the training environment, distribution shifts are everywhere. For example, an autopilot system often needs to deal with new weather conditions that have not been seen during training, An Al-aided drug discovery system needs to predict the biochemical properties of molecules with respect to new viruses such as COVID-19. It renders the problem of Out-of-Distribution (OOD) generalization challenging to conventional machine learning. In this thesis, we investigate how to incorporate and realize the causality for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications