Neural Contrastive Clustering: Fully Unsupervised Bias Reduction for   Sentiment Classification

Jared Mowery

arXiv:2204.10467·cs.CL·April 25, 2022

Neural Contrastive Clustering: Fully Unsupervised Bias Reduction for Sentiment Classification

Jared Mowery

PDF

Open Access

TL;DR

This paper introduces a fully unsupervised neural contrastive clustering method that reduces correlation bias in sentiment classification, especially on controversial topics like COVID-19 social media data, without needing labeled data.

Contribution

It presents a novel unsupervised adversarial learning approach that effectively mitigates correlation bias in neural network sentiment classifiers, outperforming some supervised methods.

Findings

01

Approximately doubles accuracy on bias-prone sentences

02

Maintains overall F1 score of the classifier

03

Outperforms supervised masking approach in bias reduction

Abstract

Background: Neural networks produce biased classification results due to correlation bias (they learn correlations between their inputs and outputs to classify samples, even when those correlations do not represent cause-and-effect relationships). Objective: This study introduces a fully unsupervised method of mitigating correlation bias, demonstrated with sentiment classification on COVID-19 social media data. Methods: Correlation bias in sentiment classification often arises in conversations about controversial topics. Therefore, this study uses adversarial learning to contrast clusters based on sentiment classification labels, with clusters produced by unsupervised topic modeling. This discourages the neural network from learning topic-related features that produce biased classification results. Results: Compared to a baseline classifier, neural contrastive clustering…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Sentiment Analysis and Opinion Mining · Hate Speech and Cyberbullying Detection