Mastering Multiple-Expert Routing: Realizable $H$-Consistency and Strong Guarantees for Learning to Defer
Anqi Mao, Mehryar Mohri, Yutao Zhong

TL;DR
This paper develops new surrogate loss functions with strong theoretical guarantees for learning to defer to multiple experts, addressing open questions about consistency and providing empirical validation.
Contribution
It introduces novel surrogate losses with realizable $H$-consistency and strong guarantees for both single-stage and two-stage learning scenarios involving multiple experts.
Findings
New surrogate losses achieve realizable $H$-consistency.
Theoretical bounds are established for multiple-expert scenarios.
Experimental results demonstrate improved performance over baselines.
Abstract
The problem of learning to defer with multiple experts consists of optimally assigning input instances to experts, balancing the trade-off between their accuracy and computational cost. This is a critical challenge in natural language generation, but also in other fields such as image processing, and medical diagnostics. Recent studies have proposed surrogate loss functions to optimize deferral, but challenges remain in ensuring their consistency properties. This paper introduces novel surrogate loss functions and efficient algorithms with strong theoretical learning guarantees. We address open questions regarding realizable -consistency, -consistency bounds, and Bayes-consistency for both single-stage (jointly learning predictor and deferral function) and two-stage (learning only the deferral function with a fixed expert) learning scenarios. For single-stage deferral, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation · Expert finding and Q&A systems · Distributed Sensor Networks and Detection Algorithms
