The Generalized Turing Test: A Foundation for Comparing Intelligence
Daniel Mitropolsky, Susan S. Hong, Riccardo Neumarker, Emanuele Rimoldi, Tomaso Poggio

TL;DR
The paper proposes the Generalized Turing Test (GTT), a formal, task-agnostic framework for comparing agent intelligence through indistinguishability, supported by empirical evaluations on modern models.
Contribution
It introduces the GTT framework, formalizes conditions for agent comparison, and empirically demonstrates its effectiveness across various models and trials.
Findings
The GTT framework produces meaningful empirical orderings of agent capabilities.
Indistinguishability-based comparisons align with existing rankings.
The framework is adaptable with variants like querying and bounded interactions.
Abstract
We introduce the Generalized Turing Test (GTT), a formal framework for comparing the capabilities of arbitrary agents via indistinguishability. For agents A and B, we define the Turing comparator A B to hold if B, acting as a distinguisher, cannot reliably distinguish between interactions with A (instructed to imitate B) and another instance of B. This yields a dataset- and task-agnostic notion of relative intelligence. We study the comparator's structure, including conditions under which it is transitive and therefore induces an ordering over equivalence classes, and we define and analyze variants with querying, bounded interaction, and fixed distinguishers. To complement the theory, we instantiate the framework on a collection of modern models, empirically evaluating pairwise indistinguishability across thousands of trials. The resulting comparisons exhibit a stratified…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
