Unimodal Distributions for Ordinal Regression
Jaime S. Cardoso, Ricardo Cruz, Tom\'e Albuquerque

TL;DR
This paper introduces two novel methods for enforcing unimodal distributions in ordinal regression models, backed by theoretical analysis and experiments demonstrating competitive performance and high unimodality.
Contribution
It provides a theoretical foundation for unimodal distributions and proposes a new architecture and loss function to incorporate unimodality into ordinal regression.
Findings
The new architecture achieves top-2 performance.
The proposed loss maintains high unimodality.
Theoretical analysis of unimodal distributions in the probability simplex.
Abstract
In many real-world prediction tasks, class labels contain information about the relative order between labels that are not captured by commonly used loss functions such as multicategory cross-entropy. Recently, the preference for unimodal distributions in the output space has been incorporated into models and loss functions to account for such ordering information. However, current approaches rely on heuristics that lack a theoretical foundation. Here, we propose two new approaches to incorporate the preference for unimodal distributions into the predictive model. We analyse the set of unimodal distributions in the probability simplex and establish fundamental properties. We then propose a new architecture that imposes unimodal distributions and a new loss term that relies on the notion of projection in a set to promote unimodality. Experiments show the new architecture achieves top-2…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Music and Audio Processing · Statistical and Computational Modeling
