Soft-Label Integration for Robust Toxicity Classification

Zelei Cheng; Xian Wu; Jiahao Yu; Shuo Han; Xin-Qiang Cai; Xinyu Xing

arXiv:2410.14894·cs.AI·November 11, 2024

Soft-Label Integration for Robust Toxicity Classification

Zelei Cheng, Xian Wu, Jiahao Yu, Shuo Han, Xin-Qiang Cai, Xinyu Xing

PDF

Open Access 1 Repo

TL;DR

This paper presents a novel bi-level optimization framework that combines crowdsourced soft-labels with GroupDRO to improve the robustness and accuracy of toxicity classifiers, especially against out-of-distribution data.

Contribution

It introduces a new bi-level optimization method integrating crowdsourced annotations with soft-labeling and GroupDRO for robust toxicity classification.

Findings

01

Outperforms baseline methods in accuracy metrics

02

Enhances robustness against out-of-distribution data

03

Theoretically proven convergence of the optimization algorithm

Abstract

Toxicity classification in textual content remains a significant problem. Data with labels from a single annotator fall short of capturing the diversity of human perspectives. Therefore, there is a growing need to incorporate crowdsourced annotations for training an effective toxicity classifier. Additionally, the standard approach to training a classifier using empirical risk minimization (ERM) may fail to address the potential shifts between the training set and testing set due to exploiting spurious correlations. This work introduces a novel bi-level optimization framework that integrates crowdsourced annotations with the soft-labeling technique and optimizes the soft-label weights by Group Distributionally Robust Optimization (GroupDRO) to enhance the robustness against out-of-distribution (OOD) risk. We theoretically prove the convergence of our bi-level optimization algorithm.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chengzelei/crowdsource_toxicity_classification
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnalytical Methods in Pharmaceuticals

MethodsSparse Evolutionary Training