UltraIF: Advancing Instruction Following from the Wild

Kaikai An; Li Sheng; Ganqu Cui; Shuzheng Si; Ning Ding; Yu Cheng; Baobao Chang

arXiv:2502.04153·cs.CL·September 30, 2025

UltraIF: Advancing Instruction Following from the Wild

Kaikai An, Li Sheng, Ganqu Cui, Shuzheng Si, Ning Ding, Yu Cheng, Baobao Chang

PDF

Open Access 1 Repo 3 Models 2 Datasets

TL;DR

UltraIF is a scalable method that decomposes complex instructions into simpler components, enabling open-source LLMs like LLaMA-3.1-8B to match the instruction-following performance of proprietary models without additional benchmark data.

Contribution

We introduce UltraIF, a novel approach that decomposes and synthesizes complex instructions, significantly improving open-source LLMs' instruction-following capabilities without relying on benchmark data.

Findings

01

LLaMA-3.1-8B-Base matches instruct models on 5 benchmarks

02

UltraIF improves LLaMA-3.1-8B-Base's instruction-following performance

03

Self-alignment with UltraIF further enhances LLaMA-3.1-8B-Instruct

Abstract

Instruction-following made modern large language models (LLMs) helpful assistants. However, the key to taming LLMs on complex instructions remains mysterious, for that there are huge gaps between models trained by open-source community and those trained by leading companies. To bridge the gap, we propose a simple and scalable approach UltraIF for building LLMs that can follow complex instructions with open-source data. UltraIF first decomposes real-world user prompts into simpler queries, constraints, and corresponding evaluation questions for the constraints. Then, we train an UltraComposer to compose constraint-associated prompts with evaluation questions. This prompt composer allows us to synthesize complicated instructions as well as filter responses with evaluation questions. In our experiment, for the first time, we successfully align LLaMA-3.1-8B-Base to catch up with its…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kkk-an/ultraif
pytorchOfficial

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReflective Practices in Education · Experimental Learning in Engineering

MethodsALIGN