Understanding Iterative Revision from Human-Written Text

Wanyu Du; Vipul Raheja; Dhruv Kumar; Zae Myung Kim; Melissa Lopez,; Dongyeop Kang

arXiv:2203.03802·cs.CL·September 27, 2022

Understanding Iterative Revision from Human-Written Text

Wanyu Du, Vipul Raheja, Dhruv Kumar, Zae Myung Kim, Melissa Lopez,, Dongyeop Kang

PDF

1 Repo 3 Models 4 Datasets

TL;DR

This paper introduces IteraTeR, a large-scale, multi-domain corpus of human-written, iteratively revised texts with annotated edit intentions, advancing understanding and modeling of the revision process across various writing contexts.

Contribution

It presents the first comprehensive, multi-domain corpus of annotated iterative revisions and demonstrates how incorporating edit intentions improves computational revision models.

Findings

01

Annotated edit intentions enhance model performance

02

IteraTeR covers diverse domains and revision granularities

03

Better understanding of revision quality and process

Abstract

Writing is, by nature, a strategic, adaptive, and more importantly, an iterative process. A crucial part of writing is editing and revising the text. Previous works on text revision have focused on defining edit intention taxonomies within a single domain or developing computational models with a single level of edit granularity, such as sentence-level edits, which differ from human's revision cycles. This work describes IteraTeR: the first large-scale, multi-domain, edit-intention annotated corpus of iteratively revised text. In particular, IteraTeR is collected based on a new framework to comprehensively model the iterative text revisions that generalize to various domains of formal writing, edit intentions, revision depths, and granularities. When we incorporate our annotated edit intentions, both generative and edit-based text revision models significantly improve automatic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vipulraheja/iterater
pytorchOfficial

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.