DeMod: A Holistic Tool with Explainable Detection and Personalized Modification for Toxicity Censorship

Yaqiong Li; Peng Zhang; Hansu Gu; Tun Lu; Siyuan Qiao; Yubo Shu; Yiyang Shao; Ning Gu

arXiv:2411.01844·cs.HC·May 23, 2025

DeMod: A Holistic Tool with Explainable Detection and Personalized Modification for Toxicity Censorship

Yaqiong Li, Peng Zhang, Hansu Gu, Tun Lu, Siyuan Qiao, Yubo Shu, Yiyang Shao, Ning Gu

PDF

Open Access

TL;DR

DeMod is a comprehensive, user-friendly tool that combines explainable toxicity detection with personalized content modification, addressing complex censorship needs beyond mere detection.

Contribution

This paper introduces DeMod, a novel ChatGPT-based system that integrates explainable detection and personalized content modification for toxicity censorship.

Findings

01

DeMod achieves high accuracy in toxicity detection.

02

Users find DeMod's explanations and modifications helpful.

03

The tool demonstrates ease of use and functional richness.

Abstract

Although there have been automated approaches and tools supporting toxicity censorship for social posts, most of them focus on detection. Toxicity censorship is a complex process, wherein detection is just an initial task and a user can have further needs such as rationale understanding and content modification. For this problem, we conduct a needfinding study to investigate people's diverse needs in toxicity censorship and then build a ChatGPT-based censorship tool named DeMod accordingly. DeMod is equipped with the features of explainable Detection and personalized Modification, providing fine-grained detection results, detailed explanations, and personalized modification suggestions. We also implemented the tool and recruited 35 Weibo users for evaluation. The results suggest DeMod's multiple strengths like the richness of functionality, the accuracy of censorship, and ease of use.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning

MethodsFocus