Investigating the Histogram Loss in Regression
Ehsan Imani, Kai Luedemann, Sam Scholnick-Hughes, Esraa Elelimy,, Martha White

TL;DR
This paper analyzes the Histogram Loss for regression, revealing that its performance gains stem mainly from optimization improvements rather than modeling additional information, and demonstrates its practical viability.
Contribution
It provides both theoretical and empirical insights into why Histogram Loss improves regression performance and shows its effectiveness without extensive hyperparameter tuning.
Findings
Performance gains are mainly due to optimization improvements.
Histogram Loss is effective in practical deep learning tasks.
No extensive hyperparameter tuning needed for viability.
Abstract
It is becoming increasingly common in regression to train neural networks that model the entire distribution even if only the mean is required for prediction. This additional modeling often comes with performance gain and the reasons behind the improvement are not fully known. This paper investigates a recent approach to regression, the Histogram Loss, which involves learning the conditional distribution of the target variable by minimizing the cross-entropy between a target distribution and a flexible histogram prediction. We design theoretical and empirical analyses to determine why and when this performance gain appears, and how different components of the loss contribute to it. Our results suggest that the benefits of learning distributions in this setup come from improvements in optimization rather than modelling extra information. We then demonstrate the viability of the Histogram…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification
