SaccadeCam: Adaptive Visual Attention for Monocular Depth Sensing

Brevin Tilmon; Sanjeev J. Koppal

arXiv:2103.12981·cs.CV·August 18, 2021

SaccadeCam: Adaptive Visual Attention for Monocular Depth Sensing

Brevin Tilmon, Sanjeev J. Koppal

PDF

Open Access 1 Repo

TL;DR

SaccadeCam introduces an adaptive, self-supervised visual attention mechanism inspired by animal saccades to improve monocular depth sensing, demonstrating promising results in both simulation and hardware prototype.

Contribution

The paper presents a novel self-supervised network for adaptive resolution in monocular depth estimation, inspired by animal eye saccades, with initial hardware prototype results.

Findings

01

Effective adaptive resolution distribution improves depth estimation.

02

End-to-end learning enhances monocular depth sensing.

03

Preliminary hardware results validate the approach.

Abstract

Most monocular depth sensing methods use conventionally captured images that are created without considering scene content. In contrast, animal eyes have fast mechanical motions, called saccades, that control how the scene is imaged by the fovea, where resolution is highest. In this paper, we present the SaccadeCam framework for adaptively distributing resolution onto regions of interest in the scene. Our algorithm for adaptive resolution is a self-supervised network and we demonstrate results for end-to-end learning for monocular depth estimation. We also show preliminary results with a real SaccadeCam hardware prototype.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

btilmon/saccadeCam
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Robotics and Sensor-Based Localization