CyberWallE at SemEval-2020 Task 11: An Analysis of Feature Engineering   for Ensemble Models for Propaganda Detection

Verena Blaschke; Maxim Korniyenko; Sam Tureski

arXiv:2008.09859·cs.CL·August 25, 2020·1 cites

CyberWallE at SemEval-2020 Task 11: An Analysis of Feature Engineering for Ensemble Models for Propaganda Detection

Verena Blaschke, Maxim Korniyenko, Sam Tureski

PDF

Open Access 1 Repo

TL;DR

This paper presents an ensemble approach with feature engineering for propaganda detection in news articles, achieving competitive rankings in SemEval-2020 tasks using BERT embeddings and lexical features.

Contribution

The paper introduces a novel ensemble model with feature engineering for propaganda detection, combining BERT embeddings and lexical features for improved performance.

Findings

01

Achieved 8th place in Span Identification subtask (F1: 43.86%)

02

Achieved 8th place in Technique Classification subtask (F1: 57.37%)

03

Demonstrated effectiveness of feature engineering with BERT in ensemble models.

Abstract

This paper describes our participation in the SemEval-2020 task Detection of Propaganda Techniques in News Articles. We participate in both subtasks: Span Identification (SI) and Technique Classification (TC). We use a bi-LSTM architecture in the SI subtask and train a complex ensemble model for the TC subtask. Our architectures are built using embeddings from BERT in combination with additional lexical features and extensive label post-processing. Our systems achieve a rank of 8 out of 35 teams in the SI subtask (F1-score: 43.86%) and 8 out of 31 teams in the TC subtask (F1-score: 57.37%).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cicl-iscl/CyberWallE-propaganda-detection
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Topic Modeling · Sentiment Analysis and Opinion Mining

MethodsLinear Layer · Attention Dropout · Weight Decay · Adam · Dropout · WordPiece · Multi-Head Attention · Residual Connection · Refunds@Expedia|||How do I get a full refund from Expedia? · Softmax