Segmentation-Aware Convolutional Networks Using Local Attention Masks

Adam W. Harley; Konstantinos G. Derpanis; Iasonas Kokkinos

arXiv:1708.04607·cs.CV·August 16, 2017

Segmentation-Aware Convolutional Networks Using Local Attention Masks

Adam W. Harley, Konstantinos G. Derpanis, Iasonas Kokkinos

PDF

1 Repo

TL;DR

This paper presents a segmentation-aware CNN that integrates local segmentation cues via attention masks, improving spatial precision and performance in dense prediction tasks like semantic segmentation and optical flow.

Contribution

The authors introduce a novel method to incorporate segmentation information into CNNs using local attention masks, enhancing spatial accuracy and performance.

Findings

01

Achieves performance comparable to DenseCRFs in semantic segmentation.

02

Produces sharper optical flow responses than baseline networks.

03

Systematic improvements over strong baseline models in both tasks.

Abstract

We introduce an approach to integrate segmentation information within a convolutional neural network (CNN). This counter-acts the tendency of CNNs to smooth information across regions and increases their spatial precision. To obtain segmentation information, we set up a CNN to provide an embedding space where region co-membership can be estimated based on Euclidean distance. We use these embeddings to compute a local attention mask relative to every neuron position. We incorporate such masks in CNNs and replace the convolution operation with a "segmentation-aware" variant that allows a neuron to selectively attend to inputs coming from its own region. We call the resulting network a segmentation-aware CNN because it adapts its filters at each image point according to local segmentation cues. We demonstrate the merit of our method on two widely different dense prediction tasks, that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aharley/segaware
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConvolution