Learning to Navigate the Energy Landscape
Julien Valentin, Angela Dai, Matthias Nie{\ss}ner, Pushmeet, Kohli, Philip Torr, Shahram Izadi, Cem Keskin

TL;DR
This paper introduces a new architecture for computer vision that improves analysis by synthesis methods by providing multiple initial solutions and a navigational structure for efficient, gradient-free local search, achieving state-of-the-art results.
Contribution
It proposes a novel approach that enhances analysis by synthesis with multiple initial solutions and a navigational structure, surpassing traditional hybrid methods.
Findings
Achieved state-of-the-art results in RGB camera relocalization.
Demonstrated generalizability on hand pose estimation and image retrieval.
Introduced a new large-scale dataset for challenging relocalization tasks.
Abstract
In this paper, we present a novel and efficient architecture for addressing computer vision problems that use `Analysis by Synthesis'. Analysis by synthesis involves the minimization of the reconstruction error which is typically a non-convex function of the latent target variables. State-of-the-art methods adopt a hybrid scheme where discriminatively trained predictors like Random Forests or Convolutional Neural Networks are used to initialize local search algorithms. While these methods have been shown to produce promising results, they often get stuck in local optima. Our method goes beyond the conventional hybrid architecture by not only proposing multiple accurate initial solutions but by also defining a navigational structure over the solution space that can be used for extremely efficient gradient-free local search. We demonstrate the efficacy of our approach on the challenging…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
