Optimized Deferral for Imbalanced Settings

Corinna Cortes; Anqi Mao; Mehryar Mohri; Yutao Zhong

arXiv:2604.27723·cs.LG·May 1, 2026

Optimized Deferral for Imbalanced Settings

Corinna Cortes, Anqi Mao, Mehryar Mohri, Yutao Zhong

PDF

TL;DR

This paper introduces MILD, a new deferral algorithm that addresses expert imbalance in learning to defer, improving performance in image classification and LLM routing tasks.

Contribution

It develops margin-based loss functions and algorithms tailored for expert imbalance, advancing the learning to defer methodology.

Findings

01

MILD outperforms existing baselines in image classification tasks.

02

MILD improves LLM routing accuracy in real-world scenarios.

03

The proposed loss functions provide theoretical guarantees for imbalanced settings.

Abstract

Learning algorithms can be significantly improved by routing complex or uncertain inputs to specialized experts, balancing accuracy with computational cost. This approach, known as learning to defer, is essential in domains like natural language generation, medical diagnosis, and computer vision, where an effective deferral can reduce errors at low extra resource consumption. However, the two-stage learning to defer setting, which leverages existing predictors such as a collection of LLMs or other classifiers, often faces challenges due to an expert imbalance problem. This imbalance can lead to suboptimal performance, with deferral algorithms favoring the majority expert. We present a comprehensive study of two-stage learning to defer in expert imbalance settings. We cast the deferral loss optimization as a novel cost-sensitive learning problem over the input-expert domain. We derive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.