CARMA: Enhanced Compositionality in LLMs via Advanced Regularisation and Mutual Information Alignment
Nura Aljaafari, Danilo S. Carvalho, Andr\'e Freitas

TL;DR
CARMA is a scalable regularisation method that enhances compositional reasoning in large language models by improving stability and robustness of internal representations without sacrificing task performance.
Contribution
We introduce CARMA, a novel regularisation technique using mutual information and stability constraints to improve compositional generalisation in LLMs.
Findings
CARMA reduces variability in fine-tuned models.
It stabilises token representations across layers.
It improves compositional reasoning and robustness.
Abstract
Large language models (LLMs) struggle with compositional generalisation, limiting their ability to systematically combine learned components to interpret novel inputs. While architectural modifications, fine-tuning, and data augmentation improve compositionality, they often have limited adaptability, face scalability constraints, or yield diminishing returns on real data. To address this, we propose CARMA, an intervention that enhances the stability and robustness of compositional reasoning in LLMs while preserving fine-tuned performance. CARMA employs mutual information regularisation and layer-wise stability constraints to mitigate feature fragmentation, ensuring structured representations persist across and within layers. We evaluate CARMA on inverse dictionary modelling and sentiment classification, measuring its impact on semantic consistency, performance stability, and robustness…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTunneling and Rock Mechanics · Mineral Processing and Grinding
