ABCD: A Graph Framework to Convert Complex Sentences to a Covering Set   of Simple Sentences

Yanjun Gao; Ting-hao Huang; Rebecca J. Passonneau

arXiv:2106.12027·cs.CL·July 1, 2021·1 cites

ABCD: A Graph Framework to Convert Complex Sentences to a Covering Set of Simple Sentences

Yanjun Gao, Ting-hao Huang, Rebecca J. Passonneau

PDF

Open Access 2 Repos

TL;DR

This paper introduces ABCD, a graph-based neural framework for decomposing complex sentences into simple, atomic sentences, enhancing understanding for NLP tasks with a new dataset and improved accuracy over baselines.

Contribution

The paper presents a novel graph edit approach for sentence decomposition, a new dataset DeSSE, and demonstrates improved performance over traditional parsing methods.

Findings

01

ABCD achieves comparable performance to parsing baselines on MinWiki.

02

On DeSSE, ABCD outperforms encoder-decoder models in atomic sentence count accuracy.

03

The approach enables effective complex sentence decomposition with detailed error analysis.

Abstract

Atomic clauses are fundamental text units for understanding complex sentences. Identifying the atomic sentences within complex sentences is important for applications such as summarization, argument mining, discourse analysis, discourse parsing, and question answering. Previous work mainly relies on rule-based methods dependent on parsing. We propose a new task to decompose each complex sentence into simple sentences derived from the tensed clauses in the source, and a novel problem formulation as a graph edit task. Our neural model learns to Accept, Break, Copy or Drop elements of a graph that combines word adjacency and grammatical dependencies. The full processing pipeline includes modules for graph construction, graph editing, and sentence generation from the output graph. We introduce DeSSE, a new dataset designed to train and evaluate complex sentence decomposition, and MinWiki, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification