SkipFlow: Incorporating Neural Coherence Features for End-to-End   Automatic Text Scoring

Yi Tay; Minh C. Phan; Luu Anh Tuan; Siu Cheung Hui

arXiv:1711.04981·cs.AI·November 15, 2017·19 cites

SkipFlow: Incorporating Neural Coherence Features for End-to-End Automatic Text Scoring

Yi Tay, Minh C. Phan, Luu Anh Tuan, Siu Cheung Hui

PDF

Open Access 1 Repo

TL;DR

This paper introduces SkipFlow, a neural architecture that enhances automatic text scoring by modeling coherence features through relationships between hidden states in an LSTM, achieving state-of-the-art results.

Contribution

The paper proposes a novel SkipFlow mechanism that captures neural coherence features, improving long-sequence understanding in end-to-end automatic text scoring models.

Findings

01

Achieved state-of-the-art performance on the ASAP dataset.

02

Outperformed feature engineering and other deep learning models.

03

Enhanced LSTM with auxiliary neural coherence features.

Abstract

Deep learning has demonstrated tremendous potential for Automatic Text Scoring (ATS) tasks. In this paper, we describe a new neural architecture that enhances vanilla neural network models with auxiliary neural coherence features. Our new method proposes a new \textsc{SkipFlow} mechanism that models relationships between snapshots of the hidden representations of a long short-term memory (LSTM) network as it reads. Subsequently, the semantic relationships between multiple snapshots are used as auxiliary features for prediction. This has two main benefits. Firstly, essays are typically long sequences and therefore the memorization capability of the LSTM network may be insufficient. Implicit access to multiple snapshots can alleviate this problem by acting as a protection against vanishing gradients. The parameters of the \textsc{SkipFlow} mechanism also acts as an auxiliary memory.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

carsonyang518/aaai24-aes-afg
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory