Hallucination Detection and Mitigation in Scientific Text Simplification using Ensemble Approaches: DS@GT at CLEF 2025 SimpleText

Krishna Chaitanya Marturi; Heba H. Elwazzan

arXiv:2508.11823·cs.CL·August 19, 2025

Hallucination Detection and Mitigation in Scientific Text Simplification using Ensemble Approaches: DS@GT at CLEF 2025 SimpleText

Krishna Chaitanya Marturi, Heba H. Elwazzan

PDF

Open Access

TL;DR

This paper presents an ensemble approach combining multiple models to detect and mitigate hallucinations and distortions in scientific text simplification, improving the reliability of generated simplified texts.

Contribution

It introduces a novel ensemble framework integrating BERT, semantic similarity, NLI, and LLM reasoning for hallucination detection in scientific text simplification.

Findings

01

Enhanced detection accuracy through ensemble methods

02

Effective post-editing with LLM-based revisions

03

Improved robustness against information distortion

Abstract

In this paper, we describe our methodology for the CLEF 2025 SimpleText Task 2, which focuses on detecting and evaluating creative generation and information distortion in scientific text simplification. Our solution integrates multiple strategies: we construct an ensemble framework that leverages BERT-based classifier, semantic similarity measure, natural language inference model, and large language model (LLM) reasoning. These diverse signals are combined using meta-classifiers to enhance the robustness of spurious and distortion detection. Additionally, for grounded generation, we employ an LLM-based post-editing system that revises simplifications based on the original input texts.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiomedical Text Mining and Ontologies · COVID-19 diagnosis using AI · Text Readability and Simplification