Enhancing Complex Instruction Following for Large Language Models with Mixture-of-Contexts Fine-tuning

Yuheng Lu; ZiMeng Bai; Caixia Yuan; Huixing Jiang; Xiaojie Wang

arXiv:2505.11922·cs.CL·May 20, 2025

Enhancing Complex Instruction Following for Large Language Models with Mixture-of-Contexts Fine-tuning

Yuheng Lu, ZiMeng Bai, Caixia Yuan, Huixing Jiang, Xiaojie Wang

PDF

Open Access

TL;DR

This paper introduces MISO, a mixture-of-contexts fine-tuning method for large language models that improves their ability to follow complex, multi-constraint instructions by processing parallel subcontexts, leading to better instruction adherence and training efficiency.

Contribution

The paper proposes MISO, a novel extension to transformer-based LLMs that jointly considers subcontexts during fine-tuning to enhance complex instruction following.

Findings

01

MISO outperforms existing fine-tuning methods in complex instruction scenarios.

02

MISO improves training efficiency for large language models.

03

Empirical results show enhanced instruction-following accuracy.

Abstract

Large language models (LLMs) exhibit remarkable capabilities in handling natural language tasks; however, they may struggle to consistently follow complex instructions including those involve multiple constraints. Post-training LLMs using supervised fine-tuning (SFT) is a standard approach to improve their ability to follow instructions. In addressing complex instruction following, existing efforts primarily focus on data-driven methods that synthesize complex instruction-output pairs for SFT. However, insufficient attention allocated to crucial sub-contexts may reduce the effectiveness of SFT. In this work, we propose transforming sequentially structured input instruction into multiple parallel instructions containing subcontexts. To support processing this multi-input, we propose MISO (Multi-Input Single-Output), an extension to currently dominant decoder-only transformer-based LLMs.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Text Readability and Simplification · Natural Language Processing Techniques

MethodsSoftmax · Attention Is All You Need · Focus · Shrink and Fine-Tune