TalkDown: A Corpus for Condescension Detection in Context

Zijian Wang; Christopher Potts

arXiv:1909.11272·cs.CL·September 26, 2019

TalkDown: A Corpus for Condescension Detection in Context

Zijian Wang, Christopher Potts

PDF

Open Access 1 Repo

TL;DR

This paper introduces TalkDown, a new dataset for detecting condescension in context, and demonstrates that incorporating discourse information improves detection accuracy, enabling analysis of community norms across online platforms.

Contribution

The paper presents TalkDown, the first labeled dataset for condescension detection in context, and proposes models that leverage discourse representations to improve detection performance.

Findings

01

Discourse-aware models outperform language-only models in condescension detection.

02

Condescension rates vary significantly across online communities.

03

Techniques for handling low condescension prevalence improve detection robustness.

Abstract

Condescending language use is caustic; it can bring dialogues to an end and bifurcate communities. Thus, systems for condescension detection could have a large positive impact. A challenge here is that condescension is often impossible to detect from isolated utterances, as it depends on the discourse and social context. To address this, we present TalkDown, a new labeled dataset of condescending linguistic acts in context. We show that extending a language-only model with representations of the discourse improves performance, and we motivate techniques for dealing with the low rates of condescension overall. We also use our model to estimate condescension rates in various online communities and relate these differences to differing community norms.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zijwang/talkdown
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Discourse Analysis in Language Studies