On a Built-in Conflict between Deep Learning and Systematic   Generalization

Yuanpeng Li

arXiv:2208.11633·cs.LG·August 25, 2022

On a Built-in Conflict between Deep Learning and Systematic Generalization

Yuanpeng Li

PDF

Open Access 1 Repo

TL;DR

This paper investigates a fundamental conflict in deep learning where internal function sharing hampers systematic generalization, revealing that common models tend to reuse decision boundaries, which limits their ability to generalize systematically.

Contribution

The study identifies internal function sharing as a key factor weakening systematic generalization across various deep learning architectures.

Findings

01

Function sharing leads to fewer decision boundaries for new outputs.

02

Standard models like CNNs, Transformers, and LSTMs exhibit this conflict.

03

Insights may guide future research to improve systematic generalization.

Abstract

In this paper, we hypothesize that internal function sharing is one of the reasons to weaken o.o.d. or systematic generalization in deep learning for classification tasks. Under equivalent prediction, a model partitions an input space into multiple parts separated by boundaries. The function sharing prefers to reuse boundaries, leading to fewer parts for new outputs, which conflicts with systematic generalization. We show such phenomena in standard deep learning models, such as fully connected, convolutional, residual networks, LSTMs, and (Vision) Transformers. We hope this study provides novel insights into systematic generalization and forms a basis for new research directions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yuanpeng16/bcdlsg
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications