Max-Margin Object Detection
Davis E. King

TL;DR
This paper introduces Max-Margin Object Detection (MMOD), a novel learning method that optimizes over all image sub-windows without sampling, leading to significant performance improvements in object detection tasks.
Contribution
The paper presents MMOD, a new approach that directly optimizes object detection over all sub-windows, enhancing detection accuracy over traditional sampling methods.
Findings
MMOD improves detection performance on three datasets.
A single HOG filter with MMOD outperforms deformable models.
Substantial gains achieved without sub-sampling of windows.
Abstract
Most object detection methods operate by applying a binary classifier to sub-windows of an image, followed by a non-maximum suppression step where detections on overlapping sub-windows are removed. Since the number of possible sub-windows in even moderately sized image datasets is extremely large, the classifier is typically learned from only a subset of the windows. This avoids the computational difficulty of dealing with the entire set of sub-windows, however, as we will show in this paper, it leads to sub-optimal detector performance. In particular, the main contribution of this paper is the introduction of a new method, Max-Margin Object Detection (MMOD), for learning to detect objects in images. This method does not perform any sub-sampling, but instead optimizes over all sub-windows. MMOD can be used to improve any object detection method which is linear in the learned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
