Minimizing Human Intervention in Online Classification

William R\'eveillard; Vasileios Saketos; Alexandre Proutiere; Richard Combes

arXiv:2510.23557·stat.ML·May 4, 2026

Minimizing Human Intervention in Online Classification

William R\'eveillard, Vasileios Saketos, Alexandre Proutiere, Richard Combes

PDF

TL;DR

This paper explores methods to reduce human feedback in online classification with large language models, proposing algorithms with theoretical guarantees and validating them on real datasets.

Contribution

It introduces the Conservative Hull-based Classifier and Generalized Hull-based Classifier, providing regret bounds and practical extensions for minimizing expert intervention.

Findings

01

CHC achieves $ ext{O}( ext{log}^d T)$ regret for large $T$ and $d=1$

02

GHC enables more aggressive guessing with a tunable parameter

03

Validated on real-world question-answering datasets with state-of-the-art embeddings

Abstract

Training or fine-tuning large language model (LLM)-based systems often requires costly human feedback, yet there is limited understanding of how to minimize such intervention while maintaining strong error guarantees. We study this problem for LLM-based classification systems in an active learning framework: an agent sequentially labels $d$ -dimensional query embeddings drawn i.i.d. from an unknown distribution by either calling a costly expert or guessing with no feedback, with the goal of minimizing regret relative to an oracle with free expert access. When the horizon $T$ is at least exponential in the embedding dimension $d$ , the geometry of the class regions can be learned. In this regime, we propose the Conservative Hull-based Classifier (CHC), which maintains convex hulls of expert-labeled queries and calls the expert when a query lands outside all known hulls. CHC attains…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.