Troubling Trends in Machine Learning Scholarship

Zachary C. Lipton; Jacob Steinhardt

arXiv:1807.03341·stat.ML·July 27, 2018

Troubling Trends in Machine Learning Scholarship

Zachary C. Lipton, Jacob Steinhardt

PDF

TL;DR

This paper critically examines concerning trends in machine learning research, highlighting issues like confusing explanation with speculation, misattributing empirical gains, and misuse of mathematics and language, and proposes community remedies.

Contribution

It identifies and analyzes four problematic patterns in ML scholarship and suggests potential community-wide solutions to improve clarity and rigor.

Findings

01

Four problematic patterns identified: explanation vs. speculation, source of gains, mathiness, language misuse.

02

Community expansion and reviewer pool issues contribute to these trends.

03

Proposed remedies aim to enhance research clarity and integrity.

Abstract

Collectively, machine learning (ML) researchers are engaged in the creation and dissemination of knowledge about data-driven algorithms. In a given paper, researchers might aspire to any subset of the following goals, among others: to theoretically characterize what is learnable, to obtain understanding through empirically rigorous experiments, or to build a working system that has high predictive accuracy. While determining which knowledge warrants inquiry may be subjective, once the topic is fixed, papers are most valuable to the community when they act in service of the reader, creating foundational knowledge and communicating as clearly as possible. Recent progress in machine learning comes despite frequent departures from these ideals. In this paper, we focus on the following four patterns that appear to us to be trending in ML scholarship: (i) failure to distinguish between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.