Towards A Measure Of General Machine Intelligence

Gautham Venkatasubramanian; Sibesh Kar; Abhimanyu Singh; Shubham; Mishra; Dushyant Yadav; Shreyansh Chandak

arXiv:2109.12075·cs.AI·May 25, 2022

Towards A Measure Of General Machine Intelligence

Gautham Venkatasubramanian, Sibesh Kar, Abhimanyu Singh, Shubham, Mishra, Dushyant Yadav, Shreyansh Chandak

PDF

Open Access 2 Repos

TL;DR

This paper introduces a new benchmark, the g-index, to measure how efficiently AI systems acquire new skills across diverse real-world tasks, emphasizing generalization ability over brute-force training.

Contribution

It proposes a universal instruction language and a method to quantify task difficulty and skill acquisition efficiency, enabling comparison of AI systems' general intelligence capabilities.

Findings

01

The g-index effectively measures skill acquisition efficiency.

02

Current models show varying g-index scores indicating different levels of generalization.

03

The proposed benchmark can evaluate AI systems across multiple domains.

Abstract

To build general-purpose artificial intelligence systems that can deal with unknown variables across unknown domains, we need benchmarks that measure how well these systems perform on tasks they have never seen before. A prerequisite for this is a measure of a task's generalization difficulty, or how dissimilar it is from the system's prior knowledge and experience. If the skill of an intelligence system in a particular domain is defined as it's ability to consistently generate a set of instructions (or programs) to solve tasks in that domain, current benchmarks do not quantitatively measure the efficiency of acquiring new skills, making it possible to brute-force skill acquisition by training with unlimited amounts of data and compute power. With this in mind, we first propose a common language of instruction, a programming language that allows the expression of programs in the form of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputability, Logic, AI Algorithms · Teaching and Learning Programming · Machine Learning and Data Classification