Hypothesize and Bound: A Computational Focus of Attention Mechanism for Simultaneous N-D Segmentation, Pose Estimation and Classification Using Shape Priors
Diego Rother, Simon Sch\"utz, Ren\'e Vidal

TL;DR
This paper introduces a computational focus of attention framework that efficiently allocates resources to solve complex perceptual problems like segmentation, pose estimation, and classification using shape priors, ensuring global optimality.
Contribution
It presents a novel mathematical framework combining bounding mechanisms and attention strategies to efficiently process large visual data for multiple simultaneous tasks.
Findings
Framework efficiently discards hypotheses with minimal computation
Guarantees to find the globally optimal hypothesis
Runs time depends on the problem, not input bandwidth
Abstract
Given the ever increasing bandwidth of the visual information available to many intelligent systems, it is becoming essential to endow them with a sense of what is worthwhile their attention and what can be safely disregarded. This article presents a general mathematical framework to efficiently allocate the available computational resources to process the parts of the input that are relevant to solve a given perceptual problem. By this we mean to find the hypothesis H (i.e., the state of the world) that maximizes a function L(H), representing how well each hypothesis "explains" the input. Given the large bandwidth of the sensory input, fully evaluating L(H) for each hypothesis H is computationally infeasible (e.g., because it would imply checking a large number of pixels). To address this problem we propose a mathematical framework with two key ingredients. The first one is a Bounding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Vision and Imaging · Medical Image Segmentation Techniques
