An architecture for the evaluation of intelligent systems
Javier Insa-Cabrera, Jose Hernandez-Orallo

TL;DR
This paper proposes an architecture for evaluating artificial agents' intelligence by creating an interpreter capable of running diverse environments, including random ones, to measure and compare agent performance reliably.
Contribution
It introduces a novel interpreter framework that can execute various environments and assess agent intelligence based on their interactions and performance.
Findings
Developed an interpreter for multiple environment types
Enabled measurement of agent intelligence across environments
Facilitated comparison of different agents' intelligence levels
Abstract
One of the main research areas in Artificial Intelligence is the coding of agents (programs) which are able to learn by themselves in any situation. This means that agents must be useful for purposes other than those they were created for, as, for example, playing chess. In this way we try to get closer to the pristine goal of Artificial Intelligence. One of the problems to decide whether an agent is really intelligent or not is the measurement of its intelligence, since there is currently no way to measure it in a reliable way. The purpose of this project is to create an interpreter that allows for the execution of several environments, including those which are generated randomly, so that an agent (a person or a program) can interact with them. Once the interaction between the agent and the environment is over, the interpreter will measure the intelligence of the agent according to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsKnowledge Societies in the 21st Century · Educational Innovations and Technology
