Deep Structured Features for Semantic Segmentation

Michael Tschannen; Lukas Cavigelli; Fabian Mentzer; Thomas Wiatowski,; Luca Benini

arXiv:1609.07916·cs.CV·June 19, 2017

Deep Structured Features for Semantic Segmentation

Michael Tschannen, Lukas Cavigelli, Fabian Mentzer, Thomas Wiatowski,, Luca Benini

PDF

TL;DR

This paper introduces a compact neural network architecture for semantic segmentation that combines wavelet-based CNNs, kernel approximation, and a linear classifier, achieving competitive accuracy with less data and smaller models.

Contribution

The paper presents a novel, highly structured neural network architecture that is highly efficient and suitable for low-power devices, with a unique combination of pre-specified layers and a learned classifier.

Findings

01

Achieves competitive accuracy with traditional CNNs on outdoor and aerial images.

02

Requires significantly less training data to reach similar performance.

03

Has a very small model size suitable for embedded platforms.

Abstract

We propose a highly structured neural network architecture for semantic segmentation with an extremely small model size, suitable for low-power embedded and mobile platforms. Specifically, our architecture combines i) a Haar wavelet-based tree-like convolutional neural network (CNN), ii) a random layer realizing a radial basis function kernel approximation, and iii) a linear classifier. While stages i) and ii) are completely pre-specified, only the linear classifier is learned from data. We apply the proposed architecture to outdoor scene and aerial image semantic segmentation and show that the accuracy of our architecture is competitive with conventional pixel classification CNNs. Furthermore, we demonstrate that the proposed architecture is data efficient in the sense of matching the accuracy of pixel classification CNNs when trained on a much smaller data set.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.