Learning Object Location Predictors with Boosting and Grammar-Guided Feature Extraction
Damian Eads (1), Edward Rosten (2), David Helmbold (1) ((1) University, of California Santa Cruz, (2) University of Cambridge)

TL;DR
BEAMER introduces a novel spatially aware boosting approach with grammar-guided feature extraction for improved object detection in noisy aerial imagery, demonstrating strong performance across multiple detection tasks.
Contribution
It combines grammar-guided feature extraction with a spatially aware AdaBoost variant and optimized location prediction, advancing object detection in challenging noisy data.
Findings
Effective in detecting objects in noisy aerial images
Performs well on object counting, tracking, and detection tasks
Outperforms existing methods on a challenging dataset
Abstract
We present BEAMER: a new spatially exploitative approach to learning object detectors which shows excellent results when applied to the task of detecting objects in greyscale aerial imagery in the presence of ambiguous and noisy data. There are four main contributions used to produce these results. First, we introduce a grammar-guided feature extraction system, enabling the exploration of a richer feature space while constraining the features to a useful subset. This is specified with a rule-based generative grammar crafted by a human expert. Second, we learn a classifier on this data using a newly proposed variant of AdaBoost which takes into account the spatially correlated nature of the data. Third, we perform another round of training to optimize the method of converting the pixel classifications generated by boosting into a high quality set of (x, y) locations. Lastly, we carefully…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Advanced Neural Network Applications
