Verifiable Format Control for Large Language Model Generations

Zhaoyang Wang; Jinqi Jiang; Huichi Zhou; Wenhao Zheng; Xuchao Zhang,; Chetan Bansal; Huaxiu Yao

arXiv:2502.04498·cs.CL·February 10, 2025

Verifiable Format Control for Large Language Model Generations

Zhaoyang Wang, Jinqi Jiang, Huichi Zhou, Wenhao Zheng, Xuchao Zhang,, Chetan Bansal, Huaxiu Yao

PDF

Open Access

TL;DR

This paper introduces a verifiable dataset and training method to improve small LLMs' ability to follow specific formats like JSON, addressing a key limitation in their instruction-following capabilities.

Contribution

It presents a fully verifiable dataset and a training approach that enhances small LLMs' format following abilities without relying on external LLMs for validation.

Findings

01

Small LLMs struggle with fine-grained format following.

02

The proposed method improves format following in 7B-level LLMs.

03

Verifiable dataset enables efficient training without costly API calls.

Abstract

Recent Large Language Models (LLMs) have demonstrated satisfying general instruction following ability. However, small LLMs with about 7B parameters still struggle fine-grained format following (e.g., JSON format), which seriously hinder the advancements of their applications. Most existing methods focus on benchmarking general instruction following while overlook how to improve the specific format following ability for small LLMs. Besides, these methods often rely on evaluations based on advanced LLMs (e.g., GPT-4), which can introduce the intrinsic bias of LLMs and be costly due to the API calls. In this paper, we first curate a fully verifiable format following dataset VFF. In contrast to existing works often adopting external LLMs for instruction-following validations, every sample of VFF can be easily validated with a Python function. Further, we propose to leverage this verifiable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Model-Driven Software Engineering Techniques · Topic Modeling

MethodsFocus