Context Forest for efficient object detection with large mixture models
Davide Modolo, Alexander Vezhnevets, Vittorio Ferrari

TL;DR
The paper introduces Context Forest (ConF), a fast and memory-efficient method for predicting object properties from global image appearance, enabling faster and more accurate object detection, especially with large datasets.
Contribution
ConF is a novel technique that predicts object attributes to optimize component selection in detectors, significantly improving speed and accuracy over standard methods.
Findings
2x speed-up for DPM detector
10x speed-up for EE-SVM detector
2% mAP improvement by predicting object locations
Abstract
We present Context Forest (ConF), a technique for predicting properties of the objects in an image based on its global appearance. Compared to standard nearest-neighbour techniques, ConF is more accurate, fast and memory efficient. We train ConF to predict which aspects of an object class are likely to appear in a given image (e.g. which viewpoint). This enables to speed-up multi-component object detectors, by automatically selecting the most relevant components to run on that image. This is particularly useful for detectors trained from large datasets, which typically need many components to fully absorb the data and reach their peak performance. ConF provides a speed-up of 2x for the DPM detector [1] and of 10x for the EE-SVM detector [2]. To show ConF's generality, we also train it to predict at which locations objects are likely to appear in an image. Incorporating this information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods
