Investigating More Explainable and Partition-Free Compositionality Estimation for LLMs: A Rule-Generation Perspective

Ziyao Xu; Cong Wang; Houfeng Wang

arXiv:2604.27340·cs.AI·May 1, 2026

Investigating More Explainable and Partition-Free Compositionality Estimation for LLMs: A Rule-Generation Perspective

Ziyao Xu, Cong Wang, Houfeng Wang

PDF

TL;DR

This paper introduces a rule-generation approach to evaluate LLMs' compositionality, addressing explainability and dataset leakage issues in traditional tests, and offers new insights into LLMs' compositional understanding.

Contribution

It proposes a novel rule-generation perspective for compositionality estimation that overcomes limitations of existing tests and provides a new analysis framework for LLMs.

Findings

01

LLMs exhibit various compositionality characterizations.

02

The proposed method reveals compositionality deficiencies in advanced LLMs.

03

Experiments on a string-to-grid task demonstrate the effectiveness of the approach.

Abstract

Compositional generalization tests are often used to estimate the compositionality of LLMs. However, such tests have the following limitations: (1) they only focus on the output results without considering LLMs' understanding of sample compositionality, resulting in explainability defects; (2) they rely on dataset partition to form the test set with combinations unseen in the training set, suffering from combination leakage issues. In this work, we propose a novel rule-generation perspective for compositionality estimation for LLMs. It requires LLMs to generate a program as rules for dataset mapping and provides estimates of the compositionality of LLMs using complexity-based theory. The perspective addresses the limitations of compositional generalization tests and provides a new way to analyze the compositionality characterization of LLMs. We conduct experiments and analysis of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.