Pragmatic Inference Chain (PIC) Improving LLMs' Reasoning of Authentic Implicit Toxic Language
Xi Chen, Shuo Wang

TL;DR
This paper introduces Pragmatic Inference Chain (PIC), a novel prompting method that enhances large language models' ability to detect complex, implicit toxic language by improving their reasoning and inference capabilities.
Contribution
The study proposes the PIC prompting technique, inspired by cognitive science and linguistics, significantly improving LLMs' performance on authentic implicit toxic language detection tasks.
Findings
PIC improves GPT-4o and other LLMs' success rates in identifying implicit toxic language.
Models produce more explicit and coherent reasoning processes with PIC.
Potential to generalize PIC to other inference-intensive tasks like humor and metaphor understanding.
Abstract
The rapid development of large language models (LLMs) gives rise to ethical concerns about their performance, while opening new avenues for developing toxic language detection techniques. However, LLMs' unethical output and their capability of detecting toxicity have primarily been tested on language data that do not demand complex meaning inference, such as the biased associations of 'he' with programmer and 'she' with household. Nowadays toxic language adopts a much more creative range of implicit forms, thanks to advanced censorship. In this study, we collect authentic toxic interactions that evade online censorship and that are verified by human annotators as inference-intensive. To evaluate and improve LLMs' reasoning of the authentic implicit toxic language, we propose a new prompting method, Pragmatic Inference Chain (PIC), drawn on interdisciplinary findings from cognitive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques
