Deep Contextual Recurrent Residual Networks for Scene Labeling

T. Hoang Ngan Le; Chi Nhan Duong; Ligong Han; Khoa Luu; Marios; Savvides; Dipan Pal

arXiv:1704.03594·cs.CV·April 13, 2017·1 cites

Deep Contextual Recurrent Residual Networks for Scene Labeling

T. Hoang Ngan Le, Chi Nhan Duong, Ligong Han, Khoa Luu, Marios, Savvides, Dipan Pal

PDF

Open Access

TL;DR

This paper introduces a novel deep network architecture called Contextual Recurrent Residual Networks (CRRN) that effectively captures long-range contextual dependencies for scene labeling, trained end-to-end from scratch.

Contribution

The paper proposes CRRN, a fully end-to-end trainable deep network that combines residual learning with recurrent context modeling, without relying on pre-trained models.

Findings

01

CRRN achieves state-of-the-art results on four challenging scene labeling datasets.

02

CRRN effectively models long-range dependencies in scene images.

03

The method outperforms existing approaches that use pre-trained models.

Abstract

Designed as extremely deep architectures, deep residual networks which provide a rich visual representation and offer robust convergence behaviors have recently achieved exceptional performance in numerous computer vision problems. Being directly applied to a scene labeling problem, however, they were limited to capture long-range contextual dependence, which is a critical aspect. To address this issue, we propose a novel approach, Contextual Recurrent Residual Networks (CRRN) which is able to simultaneously handle rich visual representation learning and long-range context modeling within a fully end-to-end deep network. Furthermore, our proposed end-to-end CRRN is completely trained from scratch, without using any pre-trained models in contrast to most existing methods usually fine-tuned from the state-of-the-art pre-trained models, e.g. VGG-16, ResNet, etc. The experiments are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Robotics and Sensor-Based Localization

MethodsAverage Pooling · Global Average Pooling · 1x1 Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Bottleneck Residual Block · Max Pooling · Kaiming Initialization · Residual Connection · Convolution