Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and   Reasoning

Weili Nie; Zhiding Yu; Lei Mao; Ankit B. Patel; Yuke Zhu; Animashree; Anandkumar

arXiv:2010.00763·cs.AI·January 6, 2021·20 cites

Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and Reasoning

Weili Nie, Zhiding Yu, Lei Mao, Ankit B. Patel, Yuke Zhu, Animashree, Anandkumar

PDF

Open Access 1 Repo 1 Video

TL;DR

Bongard-LOGO is a new benchmark designed to evaluate human-level concept learning and reasoning in visual cognition, highlighting gaps in current AI models and emphasizing core properties like context-dependence and analogy-making.

Contribution

The paper introduces Bongard-LOGO, a large, human-interpretable visual cognition benchmark generated with a program-guided technique, capturing key properties of human cognition for AI evaluation.

Findings

01

Deep learning models perform worse than humans on the benchmark.

02

The benchmark captures core properties of human cognition such as context-dependence and analogy-making.

03

Current AI models struggle to replicate human-like concept learning and reasoning.

Abstract

Humans have an inherent ability to learn novel concepts from only a few samples and generalize these concepts to different situations. Even though today's machine learning models excel with a plethora of training data on standard recognition tasks, a considerable gap exists between machine-level pattern recognition and human-level concept learning. To narrow this gap, the Bongard problems (BPs) were introduced as an inspirational challenge for visual cognition in intelligent systems. Despite new advances in representation learning and learning to learn, BPs remain a daunting challenge for modern AI. Inspired by the original one hundred BPs, we propose a new benchmark Bongard-LOGO for human-level concept learning and reasoning. We develop a program-guided generation technique to produce a large set of human-interpretable visual cognition problems in action-oriented LOGO language. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

NVlabs/Bongard-LOGO
pytorchOfficial

Videos

Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and Reasoning· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Machine Learning and Data Classification