Towards a Comparative Framework for Compositional AI Models
Tiffany Duneau

TL;DR
This paper introduces a category theory-based framework for evaluating compositional generalisation and interpretability in natural language processing models, comparing quantum and neural architectures on a dataset testing compositionality.
Contribution
It presents a framework-agnostic approach to assess compositional generalisation and interpretability, and applies it to compare quantum and neural models within the DisCoCirc framework.
Findings
Neural models tend to overfit more than quantum models.
Both architectures perform similarly on productivity and substitutivity tasks.
Quantum and neural models differ significantly on systematicity and overgeneralisation tasks.
Abstract
The DisCoCirc framework for natural language processing allows the construction of compositional models of text, by combining units for individual words together according to the grammatical structure of the text. The compositional nature of a model can give rise to two things: compositional generalisation -- the ability of a model to generalise outside its training distribution by learning compositional rules underpinning the entire data distribution -- and compositional interpretability -- making sense of how the model works by inspecting its modular components in isolation, as well as the processes through which these components are combined. We present these notions in a framework-agnostic way using the language of category theory, and adapt a series of tests for compositional generalisation to this setting. Applying this to the DisCoCirc framework, we consider how well a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
