Language Models Should be Used to Surface the Unwritten Code of Science and Society
Honglin Bao, Siyang Wu, Jiwoong Choi, Yingrong Mao, James A. Evans

TL;DR
This paper proposes using large language models to uncover and critique the implicit biases and unwritten rules in science and society by analyzing their self-generated hypotheses and comparing them with human judgments.
Contribution
It introduces a novel framework that leverages LLMs to surface tacit societal and scientific heuristics, demonstrated through a case study on peer review analysis.
Findings
LLMs' normative priors shift towards emphasizing external connections in scientific evaluation.
Human reviewers explicitly reward certain aspects but avoid articulating contextualization, despite implicit rewards.
The framework is robust across models and can be applied broadly to surface tacit societal codes.
Abstract
This paper calls on the research community not only to investigate how human biases are inherited by large language models (LLMs) but also to explore how these biases in LLMs can be leveraged to make society's "unwritten code" - such as implicit stereotypes and heuristics - visible and accessible for critique. We introduce a conceptual framework through a case study in science: uncovering hidden rules in peer review - the factors that reviewers care about but rarely state explicitly due to normative scientific expectations. The idea of the framework is to push LLMs to speak out their heuristics through generating self-consistent hypotheses - why one paper appeared stronger in reviewer scoring - among paired papers submitted to 46 academic conferences, while iteratively searching deeper hypotheses from remaining pairs where existing hypotheses cannot explain. We observed that LLMs'…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Artificial Intelligence in Healthcare and Education · Climate Change Communication and Perception
MethodsALIGN
