The Elephant in the Room: Analyzing the Presence of Big Tech in Natural Language Processing Research
Mohamed Abdalla, Jan Philip Wahle, Terry Ruas, Aur\'elie, N\'ev\'eol, Fanny Ducel, Saif M. Mohammad, Kar\"en Fort

TL;DR
This paper analyzes the growing influence of industry in NLP research, highlighting a significant increase in industry-authored publications and funding, emphasizing the need for transparency in the field.
Contribution
It provides a comprehensive quantitative analysis of industry presence in NLP research over three decades, revealing rapid growth and dominant industry players.
Findings
Industry presence in NLP has increased by 180% from 2017 to 2022.
A few companies dominate industry-authored publications and funding.
The study underscores the importance of transparency regarding industry influence.
Abstract
Recent advances in deep learning methods for natural language processing (NLP) have created new business opportunities and made NLP research critical for industry development. As one of the big players in the field of NLP, together with governments and universities, it is important to track the influence of industry on research. In this study, we seek to quantify and characterize industry presence in the NLP community over time. Using a corpus with comprehensive metadata of 78,187 NLP publications and 701 resumes of NLP publication authors, we explore the industry presence in the field since the early 90s. We find that industry presence among NLP authors has been steady before a steep increase over the past five years (180% growth from 2017 to 2022). A few companies account for most of the publications and provide funding to academic researchers through grants and internships. Our study…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOpen Source Software Innovations
