Learning to Partially Defer for Sequences

Sahana Rayan; Ambuj Tewari

arXiv:2502.01459·stat.ME·October 10, 2025

Learning to Partially Defer for Sequences

Sahana Rayan, Ambuj Tewari

PDF

Open Access

TL;DR

This paper introduces a sequence-specific deferment framework in Learning to Defer, allowing models to selectively defer parts of sequence predictions to experts, improving cost-accuracy tradeoffs across various tasks.

Contribution

It proposes a novel sequence-level deferment setting and two post-hoc rejector models for granular deferrals, enhancing existing L2D methods.

Findings

01

Granular deferrals outperform whole deferrals in experiments.

02

Token-level rejectors enable precise deferral of specific outputs.

03

Improved cost-accuracy tradeoffs demonstrated on multiple tasks.

Abstract

In the Learning to Defer (L2D) framework, a prediction model can either make a prediction or defer it to an expert, as determined by a rejector. Current L2D methods train the rejector to decide whether to reject the {\em entire prediction}, which is not desirable when the model predicts long sequences. We present an L2D setting for sequence outputs where the system can defer \textit{specific outputs} of the whole model prediction to an expert in an effort to interleave the expert and machine throughout the prediction. We propose two types of model-based post-hoc rejectors for pre-trained predictors: a token-level rejector, which defers specific token predictions to experts with next token prediction capabilities, and a one-time rejector for experts without such abilities, which defers the remaining sequence from a specific point onward. In the experiments, we also empirically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Algorithms and Data Compression