Large Language Models and Scientific Discourse: Where's the Intelligence?
Harry Collins, Simon Thorne

TL;DR
This paper compares how humans and large language models (LLMs) build scientific knowledge, highlighting that LLMs lack access to social and tacit knowledge crucial for early scientific discovery, emphasizing human intelligence over LLMs.
Contribution
It introduces a framework contrasting human and LLM knowledge formation, emphasizing the limitations of LLMs in early scientific discourse and proposing future alignment mechanisms.
Findings
LLMs cannot access social discourse crucial for early scientific knowledge.
LLMs' understanding depends on written literature, insecure in initial knowledge stages.
Human intelligence remains central, with LLMs reflecting changes in human discourse over time.
Abstract
We explore the capabilities of Large Language Models (LLMs) by comparing the way they gather data with the way humans build knowledge. Here we examine how scientific knowledge is made and compare it with LLMs. The argument is structured by reference to two figures, one representing scientific knowledge and the other LLMs. In a 2014 study, scientists explain how they choose to ignore a 'fringe science' paper in the domain of gravitational wave physics: the decisions are made largely as a result of tacit knowledge built up in social discourse, most spoken discourse, within closed groups of experts. It is argued that LLMs cannot or do not currently access such discourse, but it is typical of the early formation of scientific knowledge. LLMs 'understanding' builds on written literatures and is therefore insecure in the case of the initial stages of knowledge building. We refer to Colin…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Artificial Intelligence in Healthcare and Education · Language and cultural evolution
