Causally-Guided Automated Feature Engineering with Multi-Agent Reinforcement Learning
Arun Vignesh Malarkkan, Wangyang Ying, Yanjie Fu

TL;DR
This paper introduces CAFE, a causally-guided multi-agent reinforcement learning framework for automated feature engineering that improves robustness and efficiency by leveraging causal discovery.
Contribution
CAFE reformulates automated feature engineering as a causally-guided sequential decision process using multi-agent reinforcement learning, enhancing robustness and interpretability.
Findings
Up to 7% improvement over strong AFE baselines.
Reduces episodes-to-convergence and improves time-to-target.
Decreases performance drop under covariate shifts by ~4x.
Abstract
Automated feature engineering (AFE) enables AI systems to autonomously construct high-utility representations from raw tabular data. However, existing AFE methods rely on statistical heuristics, yielding brittle features that fail under distribution shift. We introduce CAFE, a framework that reformulates AFE as a causally-guided sequential decision process, bridging causal discovery with reinforcement learning-driven feature construction. Phase I learns a sparse directed acyclic graph over features and the target to obtain soft causal priors, grouping features as direct, indirect, or other based on their causal influence with respect to the target. Phase II uses a cascading multi-agent deep Q-learning architecture to select causal groups and transformation operators, with hierarchical reward shaping and causal group-level exploration strategies that favor causally plausible…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Advanced Causal Inference Techniques · Bayesian Modeling and Causal Inference
