Do Large Language Models (Really) Need Statistical Foundations?
Weijie Su

TL;DR
This paper argues that statistical foundations are crucial for understanding and improving large language models, emphasizing their inherent statistical nature and the need for statistical approaches due to their complexity and black-box characteristics.
Contribution
It presents two main arguments for the importance of statistical methods in LLMs and outlines key research areas where statistics can contribute significantly.
Findings
LLMs are inherently statistical models due to data dependency and stochastic processes.
Statistical approaches are essential for handling LLMs' complexity and black-box nature.
Statistical research will form a diverse mosaic of topics in LLM development.
Abstract
Large language models (LLMs) represent a new paradigm for processing unstructured data, with applications across an unprecedented range of domains. In this paper, we address, through two arguments, whether the development and application of LLMs would genuinely benefit from foundational contributions from the statistics discipline. First, we argue affirmatively, beginning with the observation that LLMs are inherently statistical models due to their profound data dependency and stochastic generation processes, where statistical insights are naturally essential for handling variability and uncertainty. Second, we argue that the persistent black-box nature of LLMs -- stemming from their immense scale, architectural complexity, and development practices often prioritizing empirical performance over theoretical interpretability -- renders closed-form or purely mechanistic analyses generally…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
