Data-efficient Meta-models for Evaluation of Context-based Questions and Answers in LLMs

Julia Belikova; Konstantin Polev; Rauf Parchiev; Dmitry Simakov

arXiv:2505.23299·cs.CL·May 30, 2025

Data-efficient Meta-models for Evaluation of Context-based Questions and Answers in LLMs

Julia Belikova, Konstantin Polev, Rauf Parchiev, Dmitry Simakov

PDF

Open Access

TL;DR

This paper explores data-efficient methods for evaluating hallucinations in large language models, demonstrating that high performance can be achieved with significantly fewer annotated samples using optimized classification and dimensionality reduction techniques.

Contribution

It introduces a novel approach combining efficient classifiers and dimensionality reduction to reduce data annotation needs for hallucination detection in LLMs.

Findings

01

Achieves comparable performance with only 250 training samples

02

Reduces reliance on extensive annotated datasets

03

Demonstrates effectiveness on standardized QA benchmarks

Abstract

Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) systems are increasingly deployed in industry applications, yet their reliability remains hampered by challenges in detecting hallucinations. While supervised state-of-the-art (SOTA) methods that leverage LLM hidden states -- such as activation tracing and representation analysis -- show promise, their dependence on extensively annotated datasets limits scalability in real-world applications. This paper addresses the critical bottleneck of data annotation by investigating the feasibility of reducing training data requirements for two SOTA hallucination detection frameworks: Lookback Lens, which analyzes attention head dynamics, and probing-based approaches, which decode internal model representations. We propose a methodology combining efficient classification algorithms with dimensionality reduction techniques to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Text Analysis Techniques · Multimodal Machine Learning Applications

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Attention Dropout · Softmax · WordPiece · BART · Weight Decay · Multi-Head Attention