Measuring Spurious Correlation in Classification: 'Clever Hans' in Translationese
Angana Borah, Daria Pylypenko, Cristina Espana-Bonet, Josef van, Genabith

TL;DR
This paper investigates how neural classifiers for translationese may rely on spurious topic correlations rather than genuine signals, proposing measures and methods to quantify and mitigate such biases.
Contribution
The paper introduces a new measure for detecting spurious topic correlations and explores masking techniques to reduce their impact in translationese classification.
Findings
The measure aligns with clustering purity, indicating spurious correlation levels.
Masking known spurious topic signals can improve classifier reliability.
Proposed methods help distinguish genuine translationese signals from spurious correlations.
Abstract
Recent work has shown evidence of 'Clever Hans' behavior in high-performance neural translationese classifiers, where BERT-based classifiers capitalize on spurious correlations, in particular topic information, between data and target classification labels, rather than genuine translationese signals. Translationese signals are subtle (especially for professional translation) and compete with many other signals in the data such as genre, style, author, and, in particular, topic. This raises the general question of how much of the performance of a classifier is really due to spurious correlations in the data versus the signals actually targeted for by the classifier, especially for subtle target signals and in challenging (low resource) data settings. We focus on topic-based spurious correlation and approach the question from two directions: (i) where we have no knowledge about spurious…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Computational Physics and Python Applications · Topic Modeling
MethodsFocus
