XATU: A Fine-grained Instruction-based Benchmark for Explainable Text   Updates

Haopeng Zhang; Hayate Iso; Sairam Gurajada; Nikita Bhutani

arXiv:2309.11063·cs.CL·March 18, 2024·2 cites

XATU: A Fine-grained Instruction-based Benchmark for Explainable Text Updates

Haopeng Zhang, Hayate Iso, Sairam Gurajada, Nikita Bhutani

PDF

Open Access 1 Repo 1 Datasets

TL;DR

XATU is a new benchmark for fine-grained, explainable text editing tasks that evaluates large language models' capabilities across various editing challenges with interpretability.

Contribution

This paper introduces XATU, the first benchmark focusing on fine-grained, explainable text editing with diverse tasks and combined annotation methods for interpretability.

Findings

01

Instruction tuning improves editing performance.

02

Model architecture significantly affects results.

03

Explanations enhance model fine-tuning for text editing.

Abstract

Text editing is a crucial task of modifying text to better align with user intents. However, existing text editing benchmark datasets contain only coarse-grained instructions and lack explainability, thus resulting in outputs that deviate from the intended changes outlined in the gold reference. To comprehensively investigate the text editing capabilities of large language models (LLMs), this paper introduces XATU, the first benchmark specifically designed for fine-grained instruction-based explainable text editing. XATU considers finer-grained text editing tasks of varying difficulty (simplification, grammar check, fact-check, etc.), incorporating lexical, syntactic, semantic, and knowledge-intensive edit aspects. To enhance interpretability, we combine LLM-based annotation and human annotation, resulting in a benchmark that includes fine-grained instructions and gold-standard edit…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

megagonlabs/xatu
noneOfficial

Datasets

WhyDoYouCare/xatu_fruit
dataset· 9 dl
9 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Recommender Systems and Techniques

MethodsALIGN