Extended Multilingual Protest News Detection -- Shared Task 1, CASE 2021 and 2022
Ali H\"urriyeto\u{g}lu, Osman Mutlu, F{\i}rat Duru\c{s}an, Onur Uca,, Alaeddin Sel\c{c}uk G\"urel, Benjamin Radford, Yaoyao Dai, Hansi, Hettiarachchi, Niklas Stoehr, Tadashi Nomoto, Milena Slavcheva, Francielle, Vargas, Aaqib Javid, Fatih Beyhan, Erdem Y\"or\"uk

TL;DR
This paper reports on the results of the CASE 2022 Shared Task 1, focusing on multilingual protest event detection across several languages, with a focus on zero-shot learning and ensemble methods.
Contribution
It extends previous work by including new languages and data, and demonstrates effective zero-shot classification using ensemble models and multilingual data merging.
Findings
Best systems achieved 79.71-84.06 F1-macro in new languages.
Ensembling models and multilingual data merging are effective strategies.
New submissions outperform previous year's results in most subtasks.
Abstract
We report results of the CASE 2022 Shared Task 1 on Multilingual Protest Event Detection. This task is a continuation of CASE 2021 that consists of four subtasks that are i) document classification, ii) sentence classification, iii) event sentence coreference identification, and iv) event extraction. The CASE 2022 extension consists of expanding the test data with more data in previously available languages, namely, English, Hindi, Portuguese, and Spanish, and adding new test data in Mandarin, Turkish, and Urdu for Sub-task 1, document classification. The training data from CASE 2021 in English, Portuguese and Spanish were utilized. Therefore, predicting document labels in Hindi, Mandarin, Turkish, and Urdu occurs in a zero-shot setting. The CASE 2022 workshop accepts reports on systems developed for predicting test data of CASE 2021 as well. We observe that the best systems submitted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Hate Speech and Cyberbullying Detection
MethodsTest
