Small Encoders Can Rival Large Decoders in Detecting Groundedness

Istabrak Abbes; Gabriele Prato; Quentin Fournier; Fernando Rodriguez; Alaa Boukhary; Adam Elwood; Sarath Chandar

arXiv:2506.21288·cs.CL·June 27, 2025

Small Encoders Can Rival Large Decoders in Detecting Groundedness

Istabrak Abbes, Gabriele Prato, Quentin Fournier, Fernando Rodriguez, Alaa Boukhary, Adam Elwood, Sarath Chandar

PDF

Open Access 1 Video

TL;DR

This paper demonstrates that small, task-specific encoder models like RoBERTa and NomicBERT can effectively detect whether responses are grounded in context, rivaling larger LLMs while significantly reducing inference time.

Contribution

The study shows that lightweight encoders fine-tuned for groundedness detection can match large LLMs' accuracy, offering a more efficient solution.

Findings

01

Encoder models achieve comparable accuracy to large LLMs.

02

Groundedness detection reduces inference latency significantly.

03

Fine-tuned encoders are effective for factual consistency tasks.

Abstract

Augmenting large language models (LLMs) with external context significantly improves their performance in natural language processing (NLP) tasks. However, LLMs struggle to answer queries reliably when the provided context lacks information, often resorting to ungrounded speculation or internal knowledge. Groundedness - generating responses strictly supported by the context - is essential for ensuring factual consistency and trustworthiness. This study focuses on detecting whether a given query is grounded in a document provided in context before the costly answer generation by LLMs. Such a detection mechanism can significantly reduce both inference time and resource consumption. We show that lightweight, task specific encoder models such as RoBERTa and NomicBERT, fine-tuned on curated datasets, can achieve accuracy comparable to state-of-the-art LLMs, such as Llama3 8B and GPT4o, in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Small Encoders Can Rival Large Decoders in Detecting Groundedness· underline

Taxonomy

TopicsTopic Modeling