Large language models eroding science understanding: an experimental study

Harry Collins; Hartmut Grote; Paul Newbury; Patrick Sutton; Simon Thorne

arXiv:2604.25639·cs.CY·April 29, 2026

Large language models eroding science understanding: an experimental study

Harry Collins, Hartmut Grote, Paul Newbury, Patrick Sutton, Simon Thorne

PDF

TL;DR

This study investigates the susceptibility of large language models to manipulation, revealing their vulnerability to fringe scientific influences and the risks they pose to accurate scientific communication.

Contribution

It demonstrates how LLMs can be deliberately manipulated to produce misleading scientific answers, exposing vulnerabilities in their reliability.

Findings

01

Altered LLMs produced convincing but false answers.

02

Manipulation can influence LLM responses on scientific topics.

03

LLMs are vulnerable to fringe scientific material.

Abstract

This paper is under review in AI and Ethics This study examines whether large language models (LLMs) can reliably answer scientific questions and demonstrates how easily they can be influenced by fringe scientific material. The authors modified custom LLMs to prioritise knowledge in selected fringe papers on the Fine Structure Constant and Gravitational Waves, then compared their responses with those of domain experts and standard LLMs. The altered models produced fluent, convincing answers that contradicted scientific consensus and were difficult for non-experts to detect as misleading. The results show that LLMs are vulnerable to manipulation and cannot replace expert judgment, highlighting risks for public understanding of science and the potential spread of misinformation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.