Minimizing Human Intervention in Online Classification
William R\'eveillard, Vasileios Saketos, Alexandre Proutiere, Richard Combes

TL;DR
This paper explores methods to reduce human feedback in online classification with large language models, proposing algorithms with theoretical guarantees and validating them on real datasets.
Contribution
It introduces the Conservative Hull-based Classifier and Generalized Hull-based Classifier, providing regret bounds and practical extensions for minimizing expert intervention.
Findings
CHC achieves $ ext{O}( ext{log}^d T)$ regret for large $T$ and $d=1$
GHC enables more aggressive guessing with a tunable parameter
Validated on real-world question-answering datasets with state-of-the-art embeddings
Abstract
Training or fine-tuning large language model (LLM)-based systems often requires costly human feedback, yet there is limited understanding of how to minimize such intervention while maintaining strong error guarantees. We study this problem for LLM-based classification systems in an active learning framework: an agent sequentially labels -dimensional query embeddings drawn i.i.d. from an unknown distribution by either calling a costly expert or guessing with no feedback, with the goal of minimizing regret relative to an oracle with free expert access. When the horizon is at least exponential in the embedding dimension , the geometry of the class regions can be learned. In this regime, we propose the Conservative Hull-based Classifier (CHC), which maintains convex hulls of expert-labeled queries and calls the expert when a query lands outside all known hulls. CHC attains…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
