Analysis of first prototype universal intelligence tests: evaluating and comparing AI algorithms and humans
Javier Insa-Cabrera, Jose Hernandez-Orallo

TL;DR
This paper introduces a new universal intelligence testing method designed to evaluate and compare the general intelligence of various systems, including AI algorithms and humans, on a common scale.
Contribution
It presents a novel evaluation framework that assesses general intelligence across different systems, moving beyond task-specific performance measures.
Findings
Prototypes of universal intelligence tests were designed and applied to AI and humans.
Results showed the importance of calibration for consistent comparisons.
Proposed refinements aim to develop a truly universal intelligence test.
Abstract
Today, available methods that assess AI systems are focused on using empirical techniques to measure the performance of algorithms in some specific tasks (e.g., playing chess, solving mazes or land a helicopter). However, these methods are not appropriate if we want to evaluate the general intelligence of AI and, even less, if we compare it with human intelligence. The ANYNT project has designed a new method of evaluation that tries to assess AI systems using well known computational notions and problems which are as general as possible. This new method serves to assess general intelligence (which allows us to learn how to solve any new kind of problem we face) and not only to evaluate performance on a set of specific tasks. This method not only focuses on measuring the intelligence of algorithms, but also to assess any intelligent system (human beings, animals, AI, aliens?,...), and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputability, Logic, AI Algorithms · Evolutionary Algorithms and Applications · AI-based Problem Solving and Planning
