How Open Must Language Models be to Enable Reliable Scientific Inference?

James A. Michaelov; Catherine Arnett; Tyler A. Chang; Pamela D. Rivi\`ere; Samuel M. Taylor; Cameron R. Jones; Sean Trott; Roger P. Levy; Benjamin K. Bergen; Micah Altman

arXiv:2603.26539·cs.CL·May 21, 2026

How Open Must Language Models be to Enable Reliable Scientific Inference?

James A. Michaelov, Catherine Arnett, Tyler A. Chang, Pamela D. Rivi\`ere, Samuel M. Taylor, Cameron R. Jones, Sean Trott, Roger P. Levy, Benjamin K. Bergen, Micah Altman

PDF

TL;DR

This paper examines how the openness of language models affects the reliability of scientific inference, highlighting issues with closed models and proposing guidelines for responsible model use in research.

Contribution

It analyzes the impact of model openness on scientific inference and offers recommendations for mitigating related risks in research practices.

Findings

01

Closed models pose threats to reliable inference

02

Open models can improve scientific research validity

03

Guidelines for responsible model use are proposed

Abstract

How does the extent to which a model is open or closed impact the scientific inferences that can be drawn from research that involves it? In this paper, we analyze how restrictions on information about model construction and deployment threaten reliable inference. We argue that current closed models are generally ill-suited for scientific purposes, with some notable exceptions, and discuss ways in which the issues they present to reliable inference can be resolved or mitigated. We recommend that when models are used in research, potential threats to inference should be systematically identified along with the steps taken to mitigate them, and that specific justifications for model selection should be provided.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.