TL;DR
The paper introduces Orthogonal Projection Loss (OPL), a novel loss function that explicitly enforces orthogonality among class features to improve class separation and robustness in neural network training.
Contribution
It proposes OPL, a new loss that directly enforces inter-class orthogonality without extra parameters, enhancing feature separation and robustness across various tasks.
Findings
OPL improves class separation in neural networks.
OPL enhances robustness against adversarial attacks and label noise.
OPL performs well across diverse datasets and tasks.
Abstract
Deep neural networks have achieved remarkable performance on a range of classification tasks, with softmax cross-entropy (CE) loss emerging as the de-facto objective function. The CE loss encourages features of a class to have a higher projection score on the true class-vector compared to the negative classes. However, this is a relative constraint and does not explicitly force different class features to be well-separated. Motivated by the observation that ground-truth class representations in CE loss are orthogonal (one-hot encoded vectors), we develop a novel loss function termed `Orthogonal Projection Loss' (OPL) which imposes orthogonality in the feature space. OPL augments the properties of CE loss and directly enforces inter-class separation alongside intra-class clustering in the feature space through orthogonality constraints on the mini-batch level. As compared to other…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSoftmax
