Active Causal Experimentalist (ACE): Learning Intervention Strategies via Direct Preference Optimization

Patrick Cooper; Alvaro Velasquez

arXiv:2602.02451·cs.LG·February 3, 2026

Active Causal Experimentalist (ACE): Learning Intervention Strategies via Direct Preference Optimization

Patrick Cooper, Alvaro Velasquez

PDF

Open Access

TL;DR

ACE is a novel method that learns adaptive intervention strategies for causal discovery by optimizing pairwise preferences, outperforming traditional methods and discovering principled strategies through experience.

Contribution

It introduces Direct Preference Optimization for learning causal intervention policies from pairwise comparisons, enabling adaptive and principled experimental design.

Findings

01

Achieves 70-71% improvement over baselines at equal intervention budgets

02

Learns to target collider mechanisms with concentrated interventions

03

Demonstrates effectiveness across synthetic, physics, and economic datasets

Abstract

Discovering causal relationships requires controlled experiments, but experimentalists face a sequential decision problem: each intervention reveals information that should inform what to try next. Traditional approaches such as random sampling, greedy information maximization, and round-robin coverage treat each decision in isolation, unable to learn adaptive strategies from experience. We propose Active Causal Experimentalist (ACE), which learns experimental design as a sequential policy. Our key insight is that while absolute information gains diminish as knowledge accumulates (making value-based RL unstable), relative comparisons between candidate interventions remain meaningful throughout. ACE exploits this via Direct Preference Optimization, learning from pairwise intervention comparisons rather than non-stationary reward magnitudes. Across synthetic benchmarks, physics…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Causal Inference Techniques · Gaussian Processes and Bayesian Inference · Philosophy and History of Science