Gradient-based inference of abstract task representations for   generalization in neural networks

Ali Hummos; Felipe del R\'io; Brabeeba Mien Wang; Julio Hurtado,; Cristian B. Calderon; Guangyu Robert Yang

arXiv:2407.17356·cs.LG·July 25, 2024

Gradient-based inference of abstract task representations for generalization in neural networks

Ali Hummos, Felipe del R\'io, Brabeeba Mien Wang, Julio Hurtado,, Cristian B. Calderon, Guangyu Robert Yang

PDF

TL;DR

This paper introduces gradient-based inference (GBI), a method enabling neural networks to infer and manipulate abstract task representations, leading to improved generalization, learning efficiency, and robustness in adapting to new tasks.

Contribution

The paper proposes a novel gradient-based inference approach grounded in variational inference and EM framework, allowing neural networks to infer and recompose task abstractions for better generalization.

Findings

01

GBI improves learning efficiency and generalization in neural networks.

02

GBI helps limit forgetting and adapt to new tasks.

03

GBI preserves information for uncertainty estimation and out-of-distribution detection.

Abstract

Humans and many animals show remarkably adaptive behavior and can respond differently to the same input depending on their internal goals. The brain not only represents the intermediate abstractions needed to perform a computation but also actively maintains a representation of the computation itself (task abstraction). Such separation of the computation and its abstraction is associated with faster learning, flexible decision-making, and broad generalization capacity. We investigate if such benefits might extend to neural networks trained with task abstractions. For such benefits to emerge, one needs a task inference mechanism that possesses two crucial abilities: First, the ability to infer abstract task representations when no longer explicitly provided (task inference), and second, manipulate task representations to adapt to novel problems (task recomposition). To tackle this, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsVariational Inference