Let the Experts Speak: Improving Survival Prediction & Calibration via Mixture-of-Experts Heads

Todd Morrill; Aahlad Puli; Murad Megjhani; Soojin Park; Richard Zemel

arXiv:2511.09567·cs.LG·November 25, 2025

Let the Experts Speak: Improving Survival Prediction & Calibration via Mixture-of-Experts Heads

Todd Morrill, Aahlad Puli, Murad Megjhani, Soojin Park, Richard Zemel

PDF

Open Access

TL;DR

This paper introduces advanced mixture-of-experts models for survival analysis that effectively cluster patients while improving calibration and predictive accuracy, emphasizing the importance of expert expressiveness.

Contribution

The work proposes novel discrete-time deep mixture-of-experts architectures that balance clustering, calibration, and accuracy, highlighting the role of expert expressiveness in performance.

Findings

01

More expressive experts outperform fixed prototypes.

02

One architecture achieves clustering, calibration, and accuracy simultaneously.

03

Expert expressiveness is crucial for model performance.

Abstract

Deep mixture-of-experts models have attracted a lot of attention for survival analysis problems, particularly for their ability to cluster similar patients together. In practice, grouping often comes at the expense of key metrics such as calibration error and predictive accuracy. This is due to the restrictive inductive bias that mixture-of-experts imposes, that predictions for individual patients must look like predictions for the group they're assigned to. Might we be able to discover patient group structure, where it exists, while improving calibration and predictive accuracy? In this work, we introduce several discrete-time deep mixture-of-experts (MoE)-based architectures for survival analysis problems, one of which achieves all desiderata: clustering, calibration, and predictive accuracy. We show that a key differentiator between this array of MoEs is how expressive their experts…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Generative Adversarial Networks and Image Synthesis · Artificial Intelligence in Healthcare and Education