Raising the Ceiling: Better Empirical Fixation Densities for Saliency Benchmarking
Susmit Agrawal, Jannis Hollman, Matthias K\"ummerer

TL;DR
This paper introduces a new mixture model for estimating empirical fixation densities in saliency benchmarking, significantly improving reliability and enabling better failure case analysis.
Contribution
It proposes an adaptive mixture model combining KDE, bias components, and saliency models, optimized per image, to enhance fixation density estimates.
Findings
Higher interobserver consistency estimates across benchmarks.
Median gains of 5-15% in log-likelihood and up to 2% in AUC.
Over 25% improvement on images critical for failure analysis.
Abstract
Empirical fixation densities, spatial distributions estimated from human eye-tracking data, are foundational to saliency benchmarking. They directly shape benchmark conclusions, leaderboard rankings, failure case analyses, and scientific claims about human visual behavior. Yet the standard estimation method, fixed-bandwidth isotropic Gaussian KDE, has gone essentially unchanged for decades. This matters now more than ever: as the field shifts toward sample-level evaluation (failure case analysis, inverse benchmarking, per-image model comparison), reliable per-image density estimates become critical. We propose a principled mixture model that combines an adaptive-bandwidth KDE based on Abramson's method, center bias and uniform components, and a state-of-the-art saliency model, to capture different spatial and semantic types of interobserver consistency, and optimize all parameters per…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
