Automated Adversarial Collaboration for Advancing Theory Building in the Cognitive Sciences

Suyog Chandramouli; George Kachergis; Akshay Jagadish

arXiv:2604.25521·cs.AI·April 29, 2026

Automated Adversarial Collaboration for Advancing Theory Building in the Cognitive Sciences

Suyog Chandramouli, George Kachergis, Akshay Jagadish

PDF

TL;DR

This paper presents an automated framework using AI and information theory to evaluate and compare cognitive science theories through a closed-loop simulation process.

Contribution

It introduces a novel automated adversarial collaboration system combining LLMs, program synthesis, and experimental design for theory adjudication.

Findings

01

Successfully identified the ground-truth theory in simulations

02

Recovered the correct theory across various noise levels

03

Demonstrated proof of concept for in-silico theory evaluation

Abstract

Cognitive science often evaluates theories through narrow paradigms and local model comparisons, limiting the integration of evidence across tasks and realizations. We introduce an automated adversarial collaboration framework for adjudicating among competing theories even when the candidate models and experiments must be discovered during the adjudication process. The system combines LLM-based theory agents, program synthesis, and information-theoretic experimental design in a closed loop. In a simulation study spanning three classic categorization theories, the framework recovered the ground-truth theory across noise settings with weaker reliability in the hardest settings. Together, the framework and findings provide a concrete proof of concept for closed-loop, in-silico theory adjudication in cognitive science.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.