An architecture for the evaluation of intelligent systems

Javier Insa-Cabrera; Jose Hernandez-Orallo

arXiv:1102.0714·cs.AI·February 4, 2011

An architecture for the evaluation of intelligent systems

Javier Insa-Cabrera, Jose Hernandez-Orallo

PDF

Open Access

TL;DR

This paper proposes an architecture for evaluating artificial agents' intelligence by creating an interpreter capable of running diverse environments, including random ones, to measure and compare agent performance reliably.

Contribution

It introduces a novel interpreter framework that can execute various environments and assess agent intelligence based on their interactions and performance.

Findings

01

Developed an interpreter for multiple environment types

02

Enabled measurement of agent intelligence across environments

03

Facilitated comparison of different agents' intelligence levels

Abstract

One of the main research areas in Artificial Intelligence is the coding of agents (programs) which are able to learn by themselves in any situation. This means that agents must be useful for purposes other than those they were created for, as, for example, playing chess. In this way we try to get closer to the pristine goal of Artificial Intelligence. One of the problems to decide whether an agent is really intelligent or not is the measurement of its intelligence, since there is currently no way to measure it in a reliable way. The purpose of this project is to create an interpreter that allows for the execution of several environments, including those which are generated randomly, so that an agent (a person or a program) can interact with them. Once the interaction between the agent and the environment is over, the interpreter will measure the intelligence of the agent according to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsKnowledge Societies in the 21st Century · Educational Innovations and Technology