Understanding Representation of Deep Equilibrium Models from Neural Collapse Perspective
Haixiang Sun, Ye Shi

TL;DR
This paper analyzes the representation of Deep Equilibrium Models (DEQ) using Neural Collapse phenomena, revealing their geometric properties and advantages over explicit neural networks, especially in imbalanced data scenarios.
Contribution
It provides the first theoretical analysis of DEQ representations via Neural Collapse, demonstrating their geometric structure and robustness in imbalanced conditions.
Findings
Neural Collapse exists in DEQ under balanced conditions.
In imbalanced settings, DEQ shows advantages like feature convergence to simplex vertices.
DEQ demonstrates self-duality and superior handling of imbalanced datasets.
Abstract
Deep Equilibrium Model (DEQ), which serves as a typical implicit neural network, emphasizes their memory efficiency and competitive performance compared to explicit neural networks. However, there has been relatively limited theoretical analysis on the representation of DEQ. In this paper, we utilize the Neural Collapse () as a tool to systematically analyze the representation of DEQ under both balanced and imbalanced conditions. is an interesting phenomenon in the neural network training process that characterizes the geometry of class features and classifier weights. While extensively studied in traditional explicit neural networks, the phenomenon has not received substantial attention in the context of implicit neural networks. We theoretically show that exists in DEQ under balanced conditions. Moreover, in imbalanced…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Thermodynamics and Statistical Mechanics
MethodsSoftmax · Attention Is All You Need · Deep Equilibrium Models
