TL;DR
This paper introduces Gated Recurrent Convolutional Neural Networks (GRCNN), which adaptively control receptive fields using gates, improving performance on various vision tasks over previous RCNN models.
Contribution
The paper proposes gated recurrent convolution layers (GRCL) that modulate receptive fields, creating a deep GRCNN model with improved adaptability and performance.
Findings
GRCNN outperforms RCNN on object recognition, scene text recognition, and object detection.
GRCNN achieves competitive results with state-of-the-art models on benchmark datasets.
The code for GRCNN is publicly available for further research.
Abstract
The convolutional neural network (CNN) has become a basic model for solving many computer vision problems. In recent years, a new class of CNNs, recurrent convolution neural network (RCNN), inspired by abundant recurrent connections in the visual systems of animals, was proposed. The critical element of RCNN is the recurrent convolutional layer (RCL), which incorporates recurrent connections between neurons in the standard convolutional layer. With increasing number of recurrent computations, the receptive fields (RFs) of neurons in RCL expand unboundedly, which is inconsistent with biological facts. We propose to modulate the RFs of neurons by introducing gates to the recurrent connections. The gates control the amount of context information inputting to the neurons and the neurons' RFs therefore become adaptive. The resulting layer is called gated recurrent convolution layer (GRCL).…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsConvolution
