Neural Collapse in Cumulative Link Models for Ordinal Regression: An Analysis with Unconstrained Feature Model
Chuang Ma, Tomoyuki Obuchi, Toshiyuki Tanaka

TL;DR
This paper investigates the emergence of Neural Collapse phenomena in deep Ordinal Regression tasks using the Unconstrained Feature Model, revealing new geometric properties and their implications for model design.
Contribution
It extends the Neural Collapse analysis to ordinal regression, introducing the concept of Ordinal Neural Collapse and providing theoretical and empirical validation within the UFM framework.
Findings
Ordinal Neural Collapse (ONC) properties are analytically proven.
Optimal features collapse to class means with regularization.
Latent variables align according to class order, especially with zero regularization.
Abstract
A phenomenon known as ''Neural Collapse (NC)'' in deep classification tasks, in which the penultimate-layer features and the final classifiers exhibit an extremely simple geometric structure, has recently attracted considerable attention, with the expectation that it can deepen our understanding of how deep neural networks behave. The Unconstrained Feature Model (UFM) has been proposed to explain NC theoretically, and there emerges a growing body of work that extends NC to tasks other than classification and leverages it for practical applications. In this study, we investigate whether a similar phenomenon arises in deep Ordinal Regression (OR) tasks, via combining the cumulative link model for OR and UFM. We show that a phenomenon we call Ordinal Neural Collapse (ONC) indeed emerges and is characterized by the following three properties: (ONC1) all optimal features in the same class…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Generative Adversarial Networks and Image Synthesis · Stochastic Gradient Optimization Techniques
