"Sharks are not the threat humans are": Argument Component Segmentation   in School Student Essays

Tariq Alhindi; Debanjan Ghosh

arXiv:2103.04518·cs.CL·March 9, 2021·6 cites

"Sharks are not the threat humans are": Argument Component Segmentation in School Student Essays

Tariq Alhindi, Debanjan Ghosh

PDF

Open Access

TL;DR

This paper explores token-level classification methods, especially BERT-based models, for segmenting argument components in middle school essays, improving argument mining accuracy.

Contribution

It introduces a novel BERT-based multi-task learning approach for argument component segmentation in student essays, outperforming previous models.

Findings

01

BERT-based models achieve the highest accuracy in argument segmentation.

02

Multi-task learning improves token and sentence classification performance.

03

The approach is effective on a new corpus of middle school essays.

Abstract

Argument mining is often addressed by a pipeline method where segmentation of text into argumentative units is conducted first and proceeded by an argument component identification task. In this research, we apply a token-level classification to identify claim and premise tokens from a new corpus of argumentative essays written by middle school students. To this end, we compare a variety of state-of-the-art models such as discrete features and deep learning architectures (e.g., BiLSTM networks and BERT-based architectures) to identify the argument components. We demonstrate that a BERT-based multi-task learning architecture (i.e., token and sentence level classification) adaptively pretrained on a relevant unlabeled dataset obtains the best results

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Software Engineering Research

MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory · Bidirectional LSTM