CoLLAB: A Collaborative Approach for Multilingual Abuse Detection

Orchid Chetia Phukan; Yashasvi Chaurasia; Arun Balaji Buduru; Rajesh; Sharma

arXiv:2406.03205·eess.AS·June 6, 2024

CoLLAB: A Collaborative Approach for Multilingual Abuse Detection

Orchid Chetia Phukan, Yashasvi Chaurasia, Arun Balaji Buduru, Rajesh, Sharma

PDF

Open Access

TL;DR

This paper introduces CoLLAB, a novel framework for multilingual audio abuse detection that merges models across languages without retraining, improving scalability and performance in diverse linguistic environments.

Contribution

The paper proposes CoLLAB, a new model merging approach that enables multilingual abuse detection without additional training, addressing scalability and resource challenges.

Findings

01

PTM representations outperform others in AAD

02

Combining PTM representations improves accuracy

03

CoLLAB achieves competitive multilingual AAD performance

Abstract

In this study, we investigate representations from paralingual Pre-Trained model (PTM) for Audio Abuse Detection (AAD), which has not been explored for AAD. Our results demonstrate their superiority compared to other PTM representations on the ADIMA benchmark. Furthermore, combining PTM representations enhances AAD performance. Despite these improvements, challenges with cross-lingual generalizability still remain, and certain languages require training in the same language. This demands individual models for different languages, leading to scalability, maintenance, and resource allocation issues and hindering the practical deployment of AAD systems in linguistically diverse real-world environments. To address this, we introduce CoLLAB, a novel framework that doesn't require training and allows seamless merging of models trained in different languages through weight-averaging. This…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection