NAP at SemEval-2023 Task 3: Is Less Really More? (Back-)Translation as   Data Augmentation Strategies for Detecting Persuasion Techniques

Neele Falk; Annerose Eichel; Prisca Piccirilli

arXiv:2304.14179·cs.CL·April 28, 2023·1 cites

NAP at SemEval-2023 Task 3: Is Less Really More? (Back-)Translation as Data Augmentation Strategies for Detecting Persuasion Techniques

Neele Falk, Annerose Eichel, Prisca Piccirilli

PDF

Open Access

TL;DR

This paper explores the effectiveness of (back-)translation data augmentation strategies using multi-lingual transformers for detecting persuasion techniques in news, highlighting the importance of balancing human and machine-generated data.

Contribution

It demonstrates that (back-)translation enhances persuasion detection performance and provides insights into optimal data balancing for multi-lingual models.

Findings

01

Both data augmentation strategies improve detection accuracy.

02

Balancing human and machine-generated data is crucial.

03

Augmented data benefits are confirmed through automatic and human evaluations.

Abstract

Persuasion techniques detection in news in a multi-lingual setup is non-trivial and comes with challenges, including little training data. Our system successfully leverages (back-)translation as data augmentation strategies with multi-lingual transformer models for the task of detecting persuasion techniques. The automatic and human evaluation of our augmented data allows us to explore whether (back-)translation aid or hinder performance. Our in-depth analyses indicate that both data augmentation strategies boost performance; however, balancing human-produced and machine-generated data seems to be crucial.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Sentiment Analysis and Opinion Mining · Advanced Text Analysis Techniques