Measuring an Artificial Intelligence System's Performance on a Verbal IQ   Test For Young Children

Stellan Ohlsson; Robert H. Sloan; Gy\"orgy Tur\'an; Aaron Urasky

arXiv:1509.03390·cs.AI·September 14, 2015·1 cites

Measuring an Artificial Intelligence System's Performance on a Verbal IQ Test For Young Children

Stellan Ohlsson, Robert H. Sloan, Gy\"orgy Tur\'an, Aaron Urasky

PDF

Open Access

TL;DR

This study evaluates an AI system's verbal IQ using a standard children's test, revealing strengths in vocabulary and similarities but weaknesses in comprehension, highlighting areas for AI improvement.

Contribution

First application of a children's verbal IQ test to an AI system, demonstrating how standard psychometric assessments can evaluate AI language understanding.

Findings

01

AI scored average for 4-year-olds, below for 5-7 years.

02

Strengths in Vocabulary and Similarities subtests.

03

Weaknesses in Comprehension and Word Reasoning.

Abstract

We administered the Verbal IQ (VIQ) part of the Wechsler Preschool and Primary Scale of Intelligence (WPPSI-III) to the ConceptNet 4 AI system. The test questions (e.g., "Why do we shake hands?") were translated into ConceptNet 4 inputs using a combination of the simple natural language processing tools that come with ConceptNet together with short Python programs that we wrote. The question answering used a version of ConceptNet based on spectral methods. The ConceptNet system scored a WPPSI-III VIQ that is average for a four-year-old child, but below average for 5 to 7 year-olds. Large variations among subtests indicate potential areas of improvement. In particular, results were strongest for the Vocabulary and Similarities subtests, intermediate for the Information subtest, and lowest for the Comprehension and Word Reasoning subtests. Comprehension is the subtest most strongly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCognitive Abilities and Testing · Child and Animal Learning Development · Cognitive Science and Mapping