MuSC: Improving Complex Instruction Following with Multi-granularity Self-Contrastive Training

Hui Huang; Jiaheng Liu; Yancheng He; Shilong Li; Bing Xu; Conghui Zhu; Muyun Yang; Tiejun Zhao

arXiv:2502.11541·cs.CL·June 2, 2025

MuSC: Improving Complex Instruction Following with Multi-granularity Self-Contrastive Training

Hui Huang, Jiaheng Liu, Yancheng He, Shilong Li, Bing Xu, Conghui Zhu, Muyun Yang, Tiejun Zhao

PDF

Open Access 1 Repo

TL;DR

MuSC introduces a novel training framework that enhances complex instruction-following in large language models by leveraging multi-granularity self-contrastive training, eliminating the need for stronger models like GPT-4.

Contribution

It proposes a multi-granularity self-contrastive training method that improves instruction alignment without relying on advanced models, using both coarse and fine granularity techniques.

Findings

01

Significant improvement on complex instruction-following benchmarks

02

Outperforms previous self-alignment methods

03

Effective on open-sourced models

Abstract

Complex instruction-following with elaborate constraints is imperative for Large Language Models (LLMs). While existing methods have constructed data for complex instruction alignment, they all rely on a more advanced model, especially GPT-4, limiting their application. In this paper, we propose a Multi-granularity Self-Contrastive Training (MuSC) framework, to improve the complex instruction alignment without relying on a stronger model. Our method is conducted on both coarse and fine granularity. On coarse-granularity, we construct constraint-aware preference data based on instruction decomposition and recombination. On fine-granularity, we perform token-aware preference optimization with dynamic token-level supervision. Our method is evaluated on open-sourced models, and experiment results show our method achieves significant improvement on both complex and general…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

huihuichyan/musc
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInnovative Teaching and Learning Methods · Problem and Project Based Learning · Education and Critical Thinking Development

MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Layer Normalization · Residual Connection · Dense Connections · Multi-Head Attention · Position-Wise Feed-Forward Layer · Adam · Softmax