Cultural Compass: Predicting Transfer Learning Success in Offensive Language Detection with Cultural Features
Li Zhou, Antonia Karamolegkou, Wenyu Chen, Daniel Hershcovich

TL;DR
This study demonstrates that cultural features, especially cultural value surveys and offensive word distance, can predict the success of transfer learning in offensive language detection, promoting culturally sensitive NLP models.
Contribution
It introduces the use of cultural features to predict transfer learning success in offensive language detection, highlighting the importance of cultural data integration.
Findings
Cultural value surveys predict transfer learning success in OLD.
Offensive word distance further improves prediction accuracy.
Incorporating cultural information enhances model cultural adaptability.
Abstract
The increasing ubiquity of language technology necessitates a shift towards considering cultural diversity in the machine learning realm, particularly for subjective tasks that rely heavily on cultural nuances, such as Offensive Language Detection (OLD). Current understanding underscores that these tasks are substantially influenced by cultural values, however, a notable gap exists in determining if cultural features can accurately predict the success of cross-cultural transfer learning for such subjective tasks. Addressing this, our study delves into the intersection of cultural features and transfer learning effectiveness. The findings reveal that cultural value surveys indeed possess a predictive power for cross-cultural transfer learning success in OLD tasks and that it can be further improved using offensive word distance. Based on these results, we advocate for the integration of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Interpreting and Communication in Healthcare
