Deformable ConvNets v2: More Deformable, Better Results
Xizhou Zhu, Han Hu, Stephen Lin, Jifeng Dai

TL;DR
Deformable ConvNets v2 enhances the original model by improving focus on relevant image regions through increased modeling power and training strategies, leading to significant performance improvements in object detection and segmentation.
Contribution
This paper introduces a reformulated Deformable ConvNets with expanded deformation modeling and a feature mimicking training scheme for better object focus.
Findings
Significant performance gains over the original Deformable ConvNets
Leading results on COCO object detection and segmentation benchmarks
Enhanced ability to focus on pertinent image regions
Abstract
The superior performance of Deformable Convolutional Networks arises from its ability to adapt to the geometric variations of objects. Through an examination of its adaptive behavior, we observe that while the spatial support for its neural features conforms more closely than regular ConvNets to object structure, this support may nevertheless extend well beyond the region of interest, causing features to be influenced by irrelevant image content. To address this problem, we present a reformulation of Deformable ConvNets that improves its ability to focus on pertinent image regions, through increased modeling power and stronger training. The modeling power is enhanced through a more comprehensive integration of deformable convolution within the network, and by introducing a modulation mechanism that expands the scope of deformation modeling. To effectively harness this enriched modeling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
MethodsDeformable Convolutional Networks · Average Pooling · Weight Decay · SGD with Momentum · Non Maximum Suppression · Deformable RoI Pooling · RoIAlign · Mask R-CNN · Region Proposal Network · Softmax
