Mitigating Interpretation Bias in Rock Records with Large Language   Models: Insights from Paleoenvironmental Analysis

Luoqi Wang; Haipeng Li; Linshu Hu; Jiarui Cai; and Zhenhong Du

arXiv:2407.09977·physics.geo-ph·July 16, 2024

Mitigating Interpretation Bias in Rock Records with Large Language Models: Insights from Paleoenvironmental Analysis

Luoqi Wang, Haipeng Li, Linshu Hu, Jiarui Cai, and Zhenhong Du

PDF

Open Access

TL;DR

This paper presents a novel approach using Large Language Models with retrieval augmented generation to reduce interpretation bias in paleoenvironmental analysis, improving the accuracy of geological reconstructions.

Contribution

It introduces a systematic framework leveraging LLMs to generate and evaluate multiple hypotheses, mitigating human bias in rock record interpretation.

Findings

01

LLMs effectively generate multiple geological hypotheses.

02

The approach reduces interpretation bias in sedimentology and paleogeography.

03

Enhanced accuracy in paleoenvironmental reconstructions.

Abstract

The reconstruction of Earth's history faces significant challenges due to the nonunique interpretations often derived from rock records. The problem has long been recognized but there are no systematic solutions in practice. This study introduces an innovative approach that leverages Large Language Models (LLMs) along with retrieval augmented generation and real-time search capabilities to counteract interpretation biases, thereby enhancing the accuracy and reliability of geological analyses. By applying this framework to sedimentology and paleogeography, we demonstrate its effectiveness in mitigating interpretations biases through the generation and evaluation of multiple hypotheses for the same data, which can effectively reduce human bias. Our research illuminates the transformative potential of LLMs in refining paleoenvironmental studies and extends their applicability across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling