Processing Megapixel Images with Deep Attention-Sampling Models

Angelos Katharopoulos; Fran\c{c}ois Fleuret

arXiv:1905.03711·cs.CV·July 18, 2019·22 cites

Processing Megapixel Images with Deep Attention-Sampling Models

Angelos Katharopoulos, Fran\c{c}ois Fleuret

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces an attention sampling model that enables processing of megapixel images efficiently by focusing computation on informative regions, reducing resource use while maintaining accuracy.

Contribution

It presents a novel differentiable attention sampling method that processes large images efficiently and can be trained end-to-end with standard SGD.

Findings

01

Reduces computation and memory by an order of magnitude.

02

Maintains accuracy comparable to classical models.

03

Focuses on informative image regions during sampling.

Abstract

Existing deep architectures cannot operate on very large signals such as megapixel images due to computational and memory constraints. To tackle this limitation, we propose a fully differentiable end-to-end trainable model that samples and processes only a fraction of the full resolution input image. The locations to process are sampled from an attention distribution computed from a low resolution view of the input. We refer to our method as attention sampling and it can process images of several megapixels with a standard single GPU setup. We show that sampling from the attention distribution results in an unbiased estimator of the full model with minimal variance, and we derive an unbiased estimator of the gradient that we use to train our model end-to-end with a normal SGD procedure. This new method is evaluated on three classification tasks, where we show that it allows to reduce…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Processing Megapixel Images with Deep Attention-Sampling Models· youtube

Taxonomy

TopicsMedical Imaging Techniques and Applications · CCD and CMOS Imaging Sensors · Medical Image Segmentation Techniques

MethodsStochastic Gradient Descent