Relational inductive biases, deep learning, and graph networks
Peter W. Battaglia, Jessica B. Hamrick, Victor Bapst, Alvaro, Sanchez-Gonzalez, Vinicius Zambaldi, Mateusz Malinowski, Andrea Tacchetti,, David Raposo, Adam Santoro, Ryan Faulkner, Caglar Gulcehre, Francis Song,, Andrew Ballard, Justin Gilmer, George Dahl, Ashish Vaswani

TL;DR
This paper advocates for the integration of relational inductive biases into deep learning architectures, introducing graph networks as a versatile tool to enhance relational reasoning, generalization, and structured knowledge manipulation in AI.
Contribution
It introduces graph networks as a new building block for AI, unifying and extending graph-based neural approaches with a focus on relational reasoning and generalization.
Findings
Graph networks support relational reasoning tasks.
They enable better generalization beyond training experiences.
Open-source library available for practical implementation.
Abstract
Artificial intelligence (AI) has undergone a renaissance recently, making major progress in key domains such as vision, language, control, and decision-making. This has been due, in part, to cheap data and cheap compute resources, which have fit the natural strengths of deep learning. However, many defining characteristics of human intelligence, which developed under much different pressures, remain out of reach for current approaches. In particular, generalizing beyond one's experiences--a hallmark of human intelligence from infancy--remains a formidable challenge for modern AI. The following is part position paper, part review, and part unification. We argue that combinatorial generalization must be a top priority for AI to achieve human-like abilities, and that structured representations and computations are key to realizing this objective. Just as biology uses nature and nurture…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
