Square One Bias in NLP: Towards a Multi-Dimensional Exploration of the Research Manifold
Sebastian Ruder, Ivan Vuli\'c, Anders S{\o}gaard

TL;DR
This paper highlights the prevalent focus on single dimensions like accuracy or fairness in NLP research, demonstrating that multi-dimensional exploration is limited and proposing ways to broaden research scope.
Contribution
It introduces the concept of the square one bias in NLP, provides a classification of recent research along multiple dimensions, and offers practical recommendations to promote multi-dimensional exploration.
Findings
Most NLP research focuses on a single dimension such as accuracy or fairness.
Research often considers only one language or one dimension, limiting exploration.
The paper provides open-source annotations for further analysis.
Abstract
The prototypical NLP experiment trains a standard architecture on labeled English data and optimizes for accuracy, without accounting for other dimensions such as fairness, interpretability, or computational efficiency. We show through a manual classification of recent NLP research papers that this is indeed the case and refer to it as the square one experimental setup. We observe that NLP research often goes beyond the square one setup, e.g, focusing not only on accuracy, but also on fairness or interpretability, but typically only along a single dimension. Most work targeting multilinguality, for example, considers only accuracy; most work on fairness or interpretability considers only English; and so on. We show this through manual classification of recent NLP research papers and ACL Test-of-Time award recipients. Such one-dimensionality of most research means we are only exploring a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Interpreting and Communication in Healthcare
