Identity-Free Deferral For Unseen Experts
Joshua Strong, Pramit Saha, Yasin Ibrahim, Cheng Ouyang, Alison Noble

TL;DR
This paper introduces Identity-Free Deferral (IFD), a novel architecture that improves AI's ability to defer decisions to unseen experts by enforcing permutation symmetry and building expert competence profiles from limited data.
Contribution
The paper proposes IFD, an architecture that enforces permutation invariance and constructs expert competence profiles from few-shot data, enhancing generalization to unseen experts.
Findings
IFD outperforms existing methods in OOD expert scenarios.
It achieves better generalization with fewer annotations.
Experiments on medical and ImageNet benchmarks validate its effectiveness.
Abstract
Learning to Defer (L2D) improves AI reliability in decision-critical environments by training AI to either make its own prediction or defer the decision to a human expert. A key challenge is adapting to unseen experts at test time, whose competence can differ from the training population. Current methods for this task, however, can falter when unseen experts are out-of-distribution (OOD) relative to the training population. We identify a core architectural flaw as the cause: they learn identity-conditioned policies by processing class-indexed signals in fixed coordinates, creating shortcuts that violate the problem's inherent permutation symmetry. We introduce Identity-Free Deferral (IFD), an architecture that enforces this symmetry by construction. From a few-shot context, IFD builds a query-independent Bayesian competence profile for each expert. It then supplies the deferral rejector…
Peer Reviews
Decision·ICLR 2026 Poster
- The work is sound and I understand the motivation on the symmetry. - Reducing the number of predictions required by the expert is good. - The method is quite practical and scalable.
- I had a lot of trouble following the paper. The presentation could be substantially clarified: the structure feels dense, and it is often unclear how the different components are connected. A significant rewrite or reorganization might be needed to make the flow easier to follow. - Some background and notation are imprecise or undefined. For example, $r(x)$ later becomes $r(x, E)$ without a clear definition, $\Delta$ is introduced but never defined, and LCB appears before Lower Confidence Bo
The paper tackles a highly relevant and practically important problem in L2D: ensuring reliable deferral to unseen or OOD experts. This setting reflects real-world scenarios such as hospital staff turnover, making the problem both timely and significant for AI deployment. In terms of originality, the work offers a clear conceptual advance by identifying identity-conditioned shortcuts as a core limitation of existing L2D approaches and proposing an architectural solution (IFD) that enforces perm
1) Although the theoretical analysis convincingly demonstrates that standard population-adaptive L2D architectures suffer from identity-conditioned shortcuts and that IFD enforces permutation invariance by design, the paper would benefit from a direct empirical illustration of these shortcuts. A simple toy experiment (e.g. showing how a conventional L2D-Pop model fails under systematic class reindexing while IFD remains stable) would make the argument more concrete and visually intuitive. 2) Wh
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOccupational Health and Safety Research
