Transformer Models for Text Coherence Assessment

Tushar Abhishek; Daksh Rawat; Manish Gupta; and Vasudeva Varma

arXiv:2109.02176·cs.CL·February 24, 2022

Transformer Models for Text Coherence Assessment

Tushar Abhishek, Daksh Rawat, Manish Gupta, and Vasudeva Varma

PDF

Open Access 2 Repos

TL;DR

This paper introduces four Transformer-based models for automated text coherence assessment, significantly improving accuracy over previous methods across multiple datasets and tasks.

Contribution

It proposes novel Transformer architectures tailored for coherence evaluation, addressing limitations of prior approaches and achieving state-of-the-art results.

Findings

01

Transformer models outperform existing methods

02

Hierarchical and multi-task models enhance performance

03

Models generalize well across domains and tasks

Abstract

Coherence is an important aspect of text quality and is crucial for ensuring its readability. It is essential desirable for outputs from text generation systems like summarization, question answering, machine translation, question generation, table-to-text, etc. An automated coherence scoring model is also helpful in essay scoring or providing writing feedback. A large body of previous work has leveraged entity-based methods, syntactic patterns, discourse relations, and more recently traditional deep learning architectures for text coherence assessment. Previous work suffers from drawbacks like the inability to handle long-range dependencies, out-of-vocabulary words, or model sequence information. We hypothesize that coherence assessment is a cognitively complex task that requires deeper models and can benefit from other related tasks. Accordingly, in this paper, we propose four…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText Readability and Simplification · Natural Language Processing Techniques · Topic Modeling

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Layer Normalization · Residual Connection · Adam · Dropout