Large Language Models and Scientific Discourse: Where's the Intelligence?

Harry Collins; Simon Thorne

arXiv:2603.23543·physics.soc-ph·March 26, 2026

Large Language Models and Scientific Discourse: Where's the Intelligence?

Harry Collins, Simon Thorne

PDF

Open Access

TL;DR

This paper compares how humans and large language models (LLMs) build scientific knowledge, highlighting that LLMs lack access to social and tacit knowledge crucial for early scientific discovery, emphasizing human intelligence over LLMs.

Contribution

It introduces a framework contrasting human and LLM knowledge formation, emphasizing the limitations of LLMs in early scientific discourse and proposing future alignment mechanisms.

Findings

01

LLMs cannot access social discourse crucial for early scientific knowledge.

02

LLMs' understanding depends on written literature, insecure in initial knowledge stages.

03

Human intelligence remains central, with LLMs reflecting changes in human discourse over time.

Abstract

We explore the capabilities of Large Language Models (LLMs) by comparing the way they gather data with the way humans build knowledge. Here we examine how scientific knowledge is made and compare it with LLMs. The argument is structured by reference to two figures, one representing scientific knowledge and the other LLMs. In a 2014 study, scientists explain how they choose to ignore a 'fringe science' paper in the domain of gravitational wave physics: the decisions are made largely as a result of tacit knowledge built up in social discourse, most spoken discourse, within closed groups of experts. It is argued that LLMs cannot or do not currently access such discourse, but it is typical of the early formation of scientific knowledge. LLMs 'understanding' builds on written literatures and is therefore insecure in the case of the initial stages of knowledge building. We refer to Colin…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods · Artificial Intelligence in Healthcare and Education · Language and cultural evolution