Deep Contextual Recurrent Residual Networks for Scene Labeling
T. Hoang Ngan Le, Chi Nhan Duong, Ligong Han, Khoa Luu, Marios, Savvides, Dipan Pal

TL;DR
This paper introduces a novel deep network architecture called Contextual Recurrent Residual Networks (CRRN) that effectively captures long-range contextual dependencies for scene labeling, trained end-to-end from scratch.
Contribution
The paper proposes CRRN, a fully end-to-end trainable deep network that combines residual learning with recurrent context modeling, without relying on pre-trained models.
Findings
CRRN achieves state-of-the-art results on four challenging scene labeling datasets.
CRRN effectively models long-range dependencies in scene images.
The method outperforms existing approaches that use pre-trained models.
Abstract
Designed as extremely deep architectures, deep residual networks which provide a rich visual representation and offer robust convergence behaviors have recently achieved exceptional performance in numerous computer vision problems. Being directly applied to a scene labeling problem, however, they were limited to capture long-range contextual dependence, which is a critical aspect. To address this issue, we propose a novel approach, Contextual Recurrent Residual Networks (CRRN) which is able to simultaneously handle rich visual representation learning and long-range context modeling within a fully end-to-end deep network. Furthermore, our proposed end-to-end CRRN is completely trained from scratch, without using any pre-trained models in contrast to most existing methods usually fine-tuned from the state-of-the-art pre-trained models, e.g. VGG-16, ResNet, etc. The experiments are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Robotics and Sensor-Based Localization
MethodsAverage Pooling · Global Average Pooling · 1x1 Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Bottleneck Residual Block · Max Pooling · Kaiming Initialization · Residual Connection · Convolution
