TL;DR
This paper introduces an energy-based probabilistic regression method using deep neural networks to model the conditional density p(y|x), providing a clear probabilistic interpretation and outperforming existing methods on various computer vision tasks.
Contribution
The paper proposes a novel energy-based model for deep probabilistic regression with a natural probabilistic interpretation, trained via negative log-likelihood and Monte Carlo sampling.
Findings
Outperforms direct regression and confidence-based methods.
Achieves 2.2% AP improvement on COCO object detection.
Sets new state-of-the-art in visual tracking.
Abstract
While deep learning-based classification is generally tackled using standardized approaches, a wide variety of techniques are employed for regression. In computer vision, one particularly popular such technique is that of confidence-based regression, which entails predicting a confidence value for each input-target pair (x,y). While this approach has demonstrated impressive results, it requires important task-dependent design choices, and the predicted confidences lack a natural probabilistic meaning. We address these issues by proposing a general and conceptually simple regression method with a clear probabilistic interpretation. In our proposed approach, we create an energy-based model of the conditional target density p(y|x), using a deep neural network to predict the un-normalized density from (x,y). This model of p(y|x) is trained by directly minimizing the associated negative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
