Tippett-minimum Fusion of Representation-space Diffusion Models for Multi-Encoder Out-of-Distribution Detection
Neelkamal Bhuyan

TL;DR
This paper introduces EncMin2L, a multi-encoder fusion method using diffusion models for robust out-of-distribution detection across various distribution shifts, outperforming existing methods.
Contribution
It proposes a novel encoder-agnostic two-level min-gate for combining diffusion-based likelihoods without OOD labels, reducing parameter cost and improving detection accuracy.
Findings
Achieves ≥0.94 AUROC across all shift types
Outperforms state-of-the-art diffusion OOD detectors
Operates with 2.3× fewer parameters
Abstract
We address out-of-distribution (OOD) detection across the full spectrum of distribution shifts -- global domain changes, semantic divergence, texture differences, and covariate corruptions -- through a multi-encoder fusion of per-encoder representation-space diffusion models (RDMs). We statistically identify each encoder's sensitivity to specific shift types from ID data alone and introduce EncMin2L -- an encoder-agnostic two-level -gate that combines and calibrates per-encoder diffusion-based likelihood detectors without OOD labels, outperforming monolithic multi-encoder baselines at lower parameter cost. Two ID-data diagnostics: (class-conditional F-test) and (log-likelihood shift under synthetic corruptions) -- quantify encoder specialization, while a Tippett minimum -value combination aggregates per-encoder scores into a single,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
