On the definition of toxicity in NLP

Sergey Berezin; Reza Farahbakhsh; Noel Crespi

arXiv:2310.02357·cs.CL·October 23, 2023·1 cites

On the definition of toxicity in NLP

Sergey Berezin, Reza Farahbakhsh, Noel Crespi

PDF

Open Access

TL;DR

This paper proposes a new, objective, stress-level-based definition of toxicity in NLP to improve the robustness and accuracy of toxicity detection models, addressing the problem of vague and subjective existing definitions.

Contribution

It introduces a novel, stress-level-based toxicity definition and discusses its application in dataset creation and model training for NLP toxicity detection.

Findings

01

Proposes an objective, context-aware toxicity definition

02

Suggests methods for dataset creation based on stress levels

03

Aims to improve model robustness and accuracy

Abstract

The fundamental problem in toxicity detection task lies in the fact that the toxicity is ill-defined. This causes us to rely on subjective and vague data in models' training, which results in non-robust and non-accurate results: garbage in - garbage out. This work suggests a new, stress-level-based definition of toxicity designed to be objective and context-aware. On par with it, we also describe possible ways of applying this new definition to dataset creation and model training.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Machine Learning and Data Classification · Computational Drug Discovery Methods

MethodsJigsaw