Constraint Back-translation Improves Complex Instruction Following of   Large Language Models

Yunjia Qi; Hao Peng; Xiaozhi Wang; Bin Xu; Lei Hou; Juanzi Li

arXiv:2410.24175·cs.CL·April 30, 2025

Constraint Back-translation Improves Complex Instruction Following of Large Language Models

Yunjia Qi, Hao Peng, Xiaozhi Wang, Bin Xu, Lei Hou, Juanzi Li

PDF

Open Access 1 Repo 6 Models 1 Datasets

TL;DR

This paper introduces constraint back-translation, a novel data augmentation method that enhances large language models' ability to follow complex instructions by generating high-quality instruction-response pairs with implicit constraints.

Contribution

The paper proposes constraint back-translation to create better training data, significantly improving instruction-following performance of large language models.

Findings

01

Post-training on CRAB dataset improves instruction-following accuracy.

02

Constraint back-translation reduces data noise and costs.

03

Auxiliary training with constraint back-translation enhances model performance.

Abstract

Large language models (LLMs) struggle to follow instructions with complex constraints in format, length, etc. Following the conventional instruction-tuning practice, previous works conduct post-training on complex instruction-response pairs generated by feeding complex instructions to advanced LLMs. However, even advanced LLMs cannot follow complex instructions well, thus limiting the quality of generated data. In this work, we find that existing datasets inherently contain implicit complex constraints and propose a novel data generation technique, constraint back-translation. Specifically, we take the high-quality instruction-response pairs in existing datasets and only adopt advanced LLMs to add complex constraints already met by the responses to the instructions, which naturally reduces costs and data noise. In the experiments, we adopt Llama3-70B-Instruct to back-translate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

thu-keg/crab
noneOfficial

Models

Datasets

THU-KEG/Crab-SFT
dataset· 53 dl
53 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling