Training Deep Neural Networks via Direct Loss Minimization
Yang Song, Alexander G. Schwing, Richard S. Zemel, Raquel Urtasun

TL;DR
This paper introduces a direct loss minimization method for training deep neural networks that optimizes application-specific metrics like average precision, overcoming limitations of standard cross-entropy training.
Contribution
It presents a novel approach to train neural networks by directly minimizing complex, non-smooth loss functions, with a dynamic programming algorithm for efficient updates.
Findings
Outperforms standard methods on ranking tasks.
Effective in noisy label scenarios.
Improves action classification and object detection results.
Abstract
Supervised training of deep neural nets typically relies on minimizing cross-entropy. However, in many domains, we are interested in performing well on metrics specific to the application. In this paper we propose a direct loss minimization approach to train deep neural networks, which provably minimizes the application-specific loss function. This is often non-trivial, since these functions are neither smooth nor decomposable and thus are not amenable to optimization with standard gradient-based methods. We demonstrate the effectiveness of our approach in the context of maximizing average precision for ranking problems. Towards this goal, we develop a novel dynamic programming algorithm that can efficiently compute the weight updates. Our approach proves superior to a variety of baselines in the context of action classification and object detection, especially in the presence of label…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
