TATL at W-NUT 2020 Task 2: A Transformer-based Baseline System for   Identification of Informative COVID-19 English Tweets

Anh Tuan Nguyen

arXiv:2008.12854·cs.CL·September 1, 2020

TATL at W-NUT 2020 Task 2: A Transformer-based Baseline System for Identification of Informative COVID-19 English Tweets

Anh Tuan Nguyen

PDF

Open Access

TL;DR

This paper introduces a Transformer-based baseline system for identifying informative COVID-19 English Tweets, achieving competitive results in the W-NUT 2020 shared task.

Contribution

It presents a simple, effective Transformer-based approach for classifying informative COVID-19 tweets, demonstrating strong performance in a shared task setting.

Findings

01

Ranked 8th out of 56 teams in the leaderboard

02

Achieved competitive results with a simple Transformer model

03

Proposed approach is effective for COVID-19 tweet classification

Abstract

As the COVID-19 outbreak continues to spread throughout the world, more and more information about the pandemic has been shared publicly on social media. For example, there are a huge number of COVID-19 English Tweets daily on Twitter. However, the majority of those Tweets are uninformative, and hence it is important to be able to automatically select only the informative ones for downstream applications. In this short paper, we present our participation in the W-NUT 2020 Shared Task 2: Identification of Informative COVID-19 English Tweets. Inspired by the recent advances in pretrained Transformer language models, we propose a simple yet effective baseline for the task. Despite its simplicity, our proposed approach shows very competitive results in the leaderboard as we ranked 8 over 56 teams participated in total.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Misinformation and Its Impacts

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Attention Is All You Need · Byte Pair Encoding · Adam · Dropout · Label Smoothing · Multi-Head Attention · Residual Connection