Dilated Residual Networks
Fisher Yu, Vladlen Koltun, Thomas Funkhouser

TL;DR
Dilated Residual Networks (DRNs) improve image classification and scene understanding by maintaining high-resolution feature maps through dilation, reducing artifacts, and enhancing performance in downstream tasks like localization and segmentation.
Contribution
This paper introduces dilated residual networks that outperform traditional networks without increasing complexity and proposes a degridding method to further boost accuracy.
Findings
DRNs outperform non-dilated networks in classification accuracy.
Degridding reduces artifacts and improves model performance.
DRNs significantly enhance downstream tasks like localization and segmentation.
Abstract
Convolutional networks for image classification progressively reduce resolution until the image is represented by tiny feature maps in which the spatial structure of the scene is no longer discernible. Such loss of spatial acuity can limit image classification accuracy and complicate the transfer of the model to downstream applications that require detailed scene understanding. These problems can be alleviated by dilation, which increases the resolution of output feature maps without reducing the receptive field of individual neurons. We show that dilated residual networks (DRNs) outperform their non-dilated counterparts in image classification without increasing the model's depth or complexity. We then study gridding artifacts introduced by dilation, develop an approach to removing these artifacts (`degridding'), and show that this further increases the performance of DRNs. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCell Image Analysis Techniques · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
