Neural Contrastive Clustering: Fully Unsupervised Bias Reduction for Sentiment Classification
Jared Mowery

TL;DR
This paper introduces a fully unsupervised neural contrastive clustering method that reduces correlation bias in sentiment classification, especially on controversial topics like COVID-19 social media data, without needing labeled data.
Contribution
It presents a novel unsupervised adversarial learning approach that effectively mitigates correlation bias in neural network sentiment classifiers, outperforming some supervised methods.
Findings
Approximately doubles accuracy on bias-prone sentences
Maintains overall F1 score of the classifier
Outperforms supervised masking approach in bias reduction
Abstract
Background: Neural networks produce biased classification results due to correlation bias (they learn correlations between their inputs and outputs to classify samples, even when those correlations do not represent cause-and-effect relationships). Objective: This study introduces a fully unsupervised method of mitigating correlation bias, demonstrated with sentiment classification on COVID-19 social media data. Methods: Correlation bias in sentiment classification often arises in conversations about controversial topics. Therefore, this study uses adversarial learning to contrast clusters based on sentiment classification labels, with clusters produced by unsupervised topic modeling. This discourages the neural network from learning topic-related features that produce biased classification results. Results: Compared to a baseline classifier, neural contrastive clustering…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Sentiment Analysis and Opinion Mining · Hate Speech and Cyberbullying Detection
