Deep Ordinal Regression using Optimal Transport Loss and Unimodal Output Probabilities
Uri Shaham, Igal Zaidman, Jonathan Svirsky

TL;DR
This paper introduces a deep ordinal regression framework that guarantees unimodal output probabilities using optimal transport loss and architectural modifications, outperforming existing methods on multiple datasets.
Contribution
It proposes a novel deep ordinal regression method that ensures unimodal predictions and employs optimal transport loss, improving performance and confidence calibration.
Findings
Consistently outperforms recent deep ordinal regression methods.
Guarantees unimodal output probabilities at inference.
Less overconfident predictions compared to baselines.
Abstract
It is often desired that ordinal regression models yield unimodal predictions. However, in many recent works this characteristic is either absent, or implemented using soft targets, which do not guarantee unimodal outputs at inference. In addition, we argue that the standard maximum likelihood objective is not suitable for ordinal regression problems, and that optimal transport is better suited for this task, as it naturally captures the order of the classes. In this work, we propose a framework for deep ordinal regression, based on unimodal output distribution and optimal transport loss. Inspired by the well-known Proportional Odds model, we propose to modify its design by using an architectural mechanism which guarantees that the model output distribution will be unimodal. We empirically analyze the different components of our proposed approach and demonstrate their contribution to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Human Pose and Action Recognition · Domain Adaptation and Few-Shot Learning
