Your Language Model May Think Too Rigidly: Achieving Reasoning   Consistency with Symmetry-Enhanced Training

Yihang Yao; Zhepeng Cen; Miao Li; William Han; Yuyou Zhang; Emerson; Liu; Zuxin Liu; Chuang Gan; Ding Zhao

arXiv:2502.17800·cs.CL·February 26, 2025

Your Language Model May Think Too Rigidly: Achieving Reasoning Consistency with Symmetry-Enhanced Training

Yihang Yao, Zhepeng Cen, Miao Li, William Han, Yuyou Zhang, Emerson, Liu, Zuxin Liu, Chuang Gan, Ding Zhao

PDF

Open Access

TL;DR

This paper introduces a data augmentation method called MEND that enhances large language models' reasoning consistency by improving their robustness to query variations through symmetry-aware training, leading to better generalization.

Contribution

The paper proposes a novel symmetry-enhanced data augmentation technique that improves LLM robustness and reasoning performance across varied query phrasings, focusing on knowledge extraction stages.

Findings

01

MEND improves reasoning accuracy across diverse query variations.

02

The approach enhances model robustness to out-of-distribution data.

03

Experiments show better generalization in logical and arithmetic reasoning tasks.

Abstract

Large Language Models (LLMs) have demonstrated strong reasoning capabilities across various tasks. However, even minor variations in query phrasing, despite preserving the underlying semantic meaning, can significantly affect their performance. To address this, we focus on enhancing LLMs' awareness of symmetry in query variations and propose syMmetry-ENhanceD (MEND) Data Augmentation, a data-centric approach that improves the model's ability to extract useful information from context. Unlike existing methods that emphasize reasoning chain augmentation, our approach improves model robustness at the knowledge extraction stage through query augmentations, enabling more data-efficient training and stronger generalization to Out-of-Distribution (OOD) settings. Extensive experiments on both logical and arithmetic reasoning tasks show that MEND enhances reasoning performance across diverse…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Semantic Web and Ontologies

MethodsMODEL EDITOR NETWORKS WITH GRADIENT DECOMPOSITION · Focus