Constraint Back-translation Improves Complex Instruction Following of Large Language Models
Yunjia Qi, Hao Peng, Xiaozhi Wang, Bin Xu, Lei Hou, Juanzi Li

TL;DR
This paper introduces constraint back-translation, a novel data augmentation method that enhances large language models' ability to follow complex instructions by generating high-quality instruction-response pairs with implicit constraints.
Contribution
The paper proposes constraint back-translation to create better training data, significantly improving instruction-following performance of large language models.
Findings
Post-training on CRAB dataset improves instruction-following accuracy.
Constraint back-translation reduces data noise and costs.
Auxiliary training with constraint back-translation enhances model performance.
Abstract
Large language models (LLMs) struggle to follow instructions with complex constraints in format, length, etc. Following the conventional instruction-tuning practice, previous works conduct post-training on complex instruction-response pairs generated by feeding complex instructions to advanced LLMs. However, even advanced LLMs cannot follow complex instructions well, thus limiting the quality of generated data. In this work, we find that existing datasets inherently contain implicit complex constraints and propose a novel data generation technique, constraint back-translation. Specifically, we take the high-quality instruction-response pairs in existing datasets and only adopt advanced LLMs to add complex constraints already met by the responses to the instructions, which naturally reduces costs and data noise. In the experiments, we adopt Llama3-70B-Instruct to back-translate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗THU-KEG/Mistral-Crab-SFTmodel· 21 dl· ♡ 521 dl♡ 5
- 🤗THU-KEG/Mistral-Crab-DPOmodel· 7 dl· ♡ 47 dl♡ 4
- 🤗THU-KEG/Llama3-Crab-SFTmodel· 8 dl8 dl
- 🤗THU-KEG/Llama3-Crab-DPOmodel· 9 dl· ♡ 29 dl♡ 2
- 🤗QuantFactory/Mistral-Crab-SFT-GGUFmodel· 147 dl· ♡ 2147 dl♡ 2
- 🤗Mungert/Mistral-Crab-DPO-GGUFmodel· 72 dl· ♡ 172 dl♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
