CTL++: Evaluating Generalization on Never-Seen Compositional Patterns of   Known Functions, and Compatibility of Neural Representations

R\'obert Csord\'as; Kazuki Irie; J\"urgen Schmidhuber

arXiv:2210.06350·cs.LG·October 13, 2022

CTL++: Evaluating Generalization on Never-Seen Compositional Patterns of Known Functions, and Compatibility of Neural Representations

R\'obert Csord\'as, Kazuki Irie, J\"urgen Schmidhuber

PDF

Open Access 1 Repo

TL;DR

This paper introduces CTL++, a diagnostic dataset to evaluate neural networks' ability to generalize systematically to unseen compositions of known functions, revealing limitations of current models and insights into neural representations.

Contribution

The paper presents CTL++, a new dataset for testing systematic generalization of neural nets on function compositions, and analyzes neural representation compatibility and failure modes.

Findings

01

Recent Transformer variants fail on CTL++ tasks.

02

Neural networks require significant overlap in training groups to learn composition.

03

Learned symbol representations are compatible in successful cases but not in failures.

Abstract

Well-designed diagnostic tasks have played a key role in studying the failure of neural nets (NNs) to generalize systematically. Famous examples include SCAN and Compositional Table Lookup (CTL). Here we introduce CTL++, a new diagnostic dataset based on compositions of unary symbolic functions. While the original CTL is used to test length generalization or productivity, CTL++ is designed to test systematicity of NNs, that is, their capability to generalize to unseen compositions of known functions. CTL++ splits functions into groups and tests performance on group elements composed in a way not seen during training. We show that recent CTL-solving Transformer variants fail on CTL++. The simplicity of the task design allows for fine-grained control of task difficulty, as well as many insightful analyses. For example, we measure how much overlap between groups is needed by tested NNs for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

robertcsordas/ctlpp
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Ferroelectric and Negative Capacitance Devices

MethodsMulti-Head Attention · Attention Is All You Need · Test · Linear Layer · Byte Pair Encoding · Absolute Position Encodings · Layer Normalization · Position-Wise Feed-Forward Layer · Residual Connection · Dropout