JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning

Chang Gao; Wenxuan Zhang; Guizhen Chen; Wai Lam

arXiv:2310.02953·cs.CL·June 10, 2025

JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning

Chang Gao, Wenxuan Zhang, Guizhen Chen, Wai Lam

PDF

Open Access 1 Repo 3 Reviews

TL;DR

JsonTuning introduces a structured approach using JSON to improve instruction tuning of large language models, enhancing their generalization, robustness, and controllability over traditional text-based methods.

Contribution

The paper proposes JsonTuning, a novel structure-to-structure instruction tuning method that explicitly encodes task information to outperform existing text-to-text approaches.

Findings

01

JsonTuning outperforms TextTuning in various benchmarks.

02

JsonTuning improves model robustness and controllability.

03

Extensive analysis confirms the effectiveness of JSON-based task representation.

Abstract

Instruction tuning is vital for enhancing the performance of large language models (LLMs), but existing text-to-text methods, referred to as TextTuning, struggle with issues such as generalization, robustness, and controllability due to their lack of explicit task structures. We introduce JsonTuning, a structure-to-structure approach that uses JSON structures to represent tasks. This method improves generalization by clarifying task elements and their relations, boosts robustness by minimizing ambiguity, and enhances controllability by allowing precise control over outputs. We conduct an extensive comparative analysis between JsonTuning and TextTuning using various language models and benchmarks. Our findings reveal that JsonTuning consistently surpasses TextTuning in terms of performance, robustness, and controllability across different scenarios. By overcoming the limitations of…

Peer Reviews

Decision·Submitted to ICLR 2024

Reviewer 01Rating 6· marginally above the acceptance thresholdConfidence 3

Strengths

- Originality: The paper proposes JsonTuning, which is a new format of instruction tuning. It reformats standard instruction tuning data into JSON format to reduce ambiguity and improve controllability. The proposed method is novel in that it leverages the LLM's understanding of structured data format - JSON and makes use of it in downstream instruction tuning. - Clarity: The paper is well-written and easy to follow. - Quality: The claims in this paper are well supported by citations or empirica

Weaknesses

- Significance: the format of instruction tuning data is only one of the many system choices of the overall instruction tuning. Others include tasks, base model, domains, and languages. Since the paper only focuses on the data format on a selection of tasks, the significance is limited. - Soundness: It is not clear if a TextTuning model with candidate answers and output control in plain text would also perform as good as JsonTuning. In other words, how much gain was from the structured format of

Reviewer 02Rating 6· marginally above the acceptance thresholdConfidence 5

Strengths

- The paper is clear and easy to follow. - The method is simple and works well.

Weaknesses

- In my opinion this paper needs some sort of analysis of the number of additional tokens introduced by the JSON format for training and inference. The additional training cost is probably negligible and unimportant, but the additional FLOPS and encoded/decoded tokens for the JSON format will add up for inference. Note: I am not saying that the fact that JSON-formatted examples have extra tokens is a weakness, but this extra cost should at least be quantified in my opinion. - I generally don't

Reviewer 03Rating 5· marginally below the acceptance thresholdConfidence 4

Strengths

This paper provides a very simple method to convert the original instruction tuning into a unified Json format. The authors also conduct comprehensive comparison against baselines trained under text-to-text formulation, and show that JsonTuning can harvest better performance. The ablation studies help us to better understand the effect of the subparts such as label space and control information.

Weaknesses

Despite the commendable performance exhibited by JsonTuning, there are still several notable weaknesses: **Introduction of Additional Knowledge in Input**: The utilization of Json formatting can aid models in generating outputs that conform to the constraints specified in the Json input. However, it also introduces supplementary knowledge, such as information pertaining to input and output types, which is originally absent in the unstructured textual instructions. To facilitate a fair compariso

Code & Models

Repositories

gao-xiao-bai/jsontuning
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications