Square One Bias in NLP: Towards a Multi-Dimensional Exploration of the   Research Manifold

Sebastian Ruder; Ivan Vuli\'c; Anders S{\o}gaard

arXiv:2206.09755·cs.CL·June 22, 2022·1 cites

Square One Bias in NLP: Towards a Multi-Dimensional Exploration of the Research Manifold

Sebastian Ruder, Ivan Vuli\'c, Anders S{\o}gaard

PDF

Open Access 1 Repo

TL;DR

This paper highlights the prevalent focus on single dimensions like accuracy or fairness in NLP research, demonstrating that multi-dimensional exploration is limited and proposing ways to broaden research scope.

Contribution

It introduces the concept of the square one bias in NLP, provides a classification of recent research along multiple dimensions, and offers practical recommendations to promote multi-dimensional exploration.

Findings

01

Most NLP research focuses on a single dimension such as accuracy or fairness.

02

Research often considers only one language or one dimension, limiting exploration.

03

The paper provides open-source annotations for further analysis.

Abstract

The prototypical NLP experiment trains a standard architecture on labeled English data and optimizes for accuracy, without accounting for other dimensions such as fairness, interpretability, or computational efficiency. We show through a manual classification of recent NLP research papers that this is indeed the case and refer to it as the square one experimental setup. We observe that NLP research often goes beyond the square one setup, e.g, focusing not only on accuracy, but also on fairness or interpretability, but typically only along a single dimension. Most work targeting multilinguality, for example, considers only accuracy; most work on fairness or interpretability considers only English; and so on. We show this through manual classification of recent NLP research papers and ACL Test-of-Time award recipients. Such one-dimensionality of most research means we are only exploring a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

google-research/url-nlp
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Interpreting and Communication in Healthcare