Sequential Optimization for Efficient High-Quality Object Proposal Generation
Ziming Zhang, Yun Liu, Xi Chen, Yanjun Zhu, Ming-Ming Cheng, Venkatesh, Saligrama, and Philip H.S. Torr

TL;DR
BING++ is a fast and efficient object proposal method that improves localization accuracy over BING by incorporating edges and segments, achieving high-quality proposals with reduced computational cost.
Contribution
The paper introduces BING++, a novel object proposal algorithm that enhances localization accuracy while maintaining high efficiency, using a probabilistic framework and sequential updates.
Findings
BING++ improves localization quality by 18.5% on VOC2007.
BING++ runs at half the speed of BING on CPU.
BING++ achieves comparable performance to state-of-the-art methods with faster runtime.
Abstract
We are motivated by the need for a generic object proposal generation algorithm which achieves good balance between object detection recall, proposal localization quality and computational efficiency. We propose a novel object proposal algorithm, BING++, which inherits the virtue of good computational efficiency of BING but significantly improves its proposal localization quality. At high level we formulate the problem of object proposal generation from a novel probabilistic perspective, based on which our BING++ manages to improve the localization quality by employing edges and segments to estimate object boundaries and update the proposals sequentially. We propose learning the parameters efficiently by searching for approximate solutions in a quantized parameter space for complexity reduction. We demonstrate the generalization of BING++ with the same fixed parameters across different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
