LESA: Linguistic Encapsulation and Semantic Amalgamation Based   Generalised Claim Detection from Online Content

Shreya Gupta; Parantak Singh; Megha Sundriyal; Md Shad Akhtar; Tanmoy; Chakraborty

arXiv:2101.11891·cs.CL·January 29, 2021

LESA: Linguistic Encapsulation and Semantic Amalgamation Based Generalised Claim Detection from Online Content

Shreya Gupta, Parantak Singh, Megha Sundriyal, Md Shad Akhtar, Tanmoy, Chakraborty

PDF

1 Repo

TL;DR

LESA is a novel framework for claim detection in online content that combines syntactic and contextual features to improve accuracy across diverse datasets, including a newly annotated Twitter dataset.

Contribution

LESA introduces a source-independent model utilizing syntactic and contextual embeddings, and provides a new annotated Twitter dataset for claim detection research.

Findings

01

LESA outperforms state-of-the-art models by 3 claim-F1 points in in-domain tests.

02

LESA improves general-domain claim detection by 2 claim-F1 points.

03

The new Twitter dataset enhances testing on unstructured online content.

Abstract

The conceptualization of a claim lies at the core of argument mining. The segregation of claims is complex, owing to the divergence in textual syntax and context across different distributions. Another pressing issue is the unavailability of labeled unstructured text for experimentation. In this paper, we propose LESA, a framework which aims at advancing headfirst into expunging the former issue by assembling a source-independent generalized model that captures syntactic features through part-of-speech and dependency embeddings, as well as contextual features through a fine-tuned language model. We resolve the latter issue by annotating a Twitter dataset which aims at providing a testing ground on a large unstructured dataset. Experimental results show that LESA improves upon the state-of-the-art performance across six benchmark claim datasets by an average of 3 claim-F1 points for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

LCS2-IIITD/LESA-EACL-2021
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.