CausalAffect: Causal Discovery for Facial Affective Understanding
Guanyu Hu, Tangzheng Lian, Dimitrios Kollias, Oya Celiktutan, Xinyu Yang

TL;DR
CausalAffect is a novel framework that discovers causal relationships among facial Action Units and expressions, improving affect recognition by integrating causal inference with facial behavior analysis.
Contribution
It introduces the first causal graph discovery method for facial affect analysis that does not require annotated causal data or priors, aligning with psychological theories.
Findings
Achieves state-of-the-art AU detection and expression recognition performance.
Recovers causal structures consistent with psychological theories.
Reveals novel inhibitory and uncharacterized dependencies.
Abstract
Understanding human affect from facial behavior requires not only accurate recognition but also structured reasoning over the latent dependencies that drive muscle activations and their expressive outcomes. Although Action Units (AUs) have long served as the foundation of affective computing, existing approaches rarely address how to infer psychologically plausible causal relations between AUs and expressions directly from data. We propose CausalAffect, the first framework for causal graph discovery in facial affect analysis. CausalAffect models AU-AU and AU-Expression dependencies through a two-level polarity and direction aware causal hierarchy that integrates population-level regularities with sample-adaptive structures. A feature-level counterfactual intervention mechanism further enforces true causal effects while suppressing spurious correlations. Crucially, our approach requires…
Peer Reviews
Decision·Submitted to ICLR 2026
* The trained models and code will be released, which is a significant contribution to the research community. * Extensive experiments, including ablation studies, are conducted on multiple datasets. The paper compares the method against competitive baselines and demonstrates promising performance. * Overall, the paper is well-written and presented in good format. * Interesting analysis and visualizations are presented in Section 4.2. These provide readers with a straightforward understanding of
* Figure 2 is cited in line 46 but is located on page 8, which is too far from the relevant text and disrupts the flow of reading. * The paper lacks a Related Work section. * While Table 1 shows promising performance, the comparison feels unfair because the best CausalAffect configuration utilizes additional training data sources. Without this extra data, the performance is very close to the best baseline, and a significant statistical test is missing to confirm the difference.
The primary strength lies in its novel formulation of facial affective analysis as a causal discovery problem, moving beyond mere correlation to seek psychologically plausible mechanisms. The proposed framework, CausalAffect, integrates a two-level (global and sample-adaptive) graph structure to capture both stable population-level rules and context-specific dynamics. Its ability to model both excitatory and inhibitory relations, combined with an efficient feature-level counterfactual interventi
- The system complexity would be a concern to me. CausalAffect contains four different modules and that caused a large number of loss functions. Their corresponding hyperparameters (e.g., $\lambda_{ib}$, $\lambda_{DAG}$, $\lambda_{consist}$) also **increase the risk of training instability** and these factors would ** make it difficult for other researchers to reproduce the results**. - The paper employs a significant number of mathematical symbols and formulas, which is commendable. However,
The idea of trying to find a proper AU relationship as well as a relationship between AUs and expressions is appealing, despite not being new, and the authors try to approach it in a data-driven way. The results are compelling and the relation between AUs and expressions across datasets is investigated, showing similar correlations than that of existing work.
The paper is poorly written, and poorly presented, with many broken sentences. The narrative is very loose and the figures and notation do not serve the understanding of the paper. The paper is full of clutter and the tables and figures have been minimized to fit in the paper to an unacceptable level. The method is not novel and combines many pieces of existing work. The discovery of knowledge-based AU graphs is not new, it has been presented in many works; as an example there are the following
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Face Recognition and Perception · Mental Health Research Topics
