Deformable Kernels: Adapting Effective Receptive Fields for Object Deformation
Hang Gao, Xizhou Zhu, Steve Lin, Jifeng Dai

TL;DR
This paper introduces Deformable Kernels, a novel convolutional operator that adapts the effective receptive field directly during runtime to better handle object deformations, improving recognition performance.
Contribution
The work proposes a new method for directly adapting the effective receptive field in convolutional networks, providing a generic, drop-in replacement for rigid kernels that enhances deformation handling.
Findings
Deformable Kernels outperform prior deformation modeling methods.
The approach effectively adapts to object deformations across multiple tasks.
Theoretical analysis confirms ERF is determined by data sampling and kernel values.
Abstract
Convolutional networks are not aware of an object's geometric variations, which leads to inefficient utilization of model and data capacity. To overcome this issue, recent works on deformation modeling seek to spatially reconfigure the data towards a common arrangement such that semantic recognition suffers less from deformation. This is typically done by augmenting static operators with learned free-form sampling grids in the image space, dynamically tuned to the data and task for adapting the receptive field. Yet adapting the receptive field does not quite reach the actual goal -- what really matters to the network is the "effective" receptive field (ERF), which reflects how much each pixel contributes. It is thus natural to design other approaches to adapt the ERF directly during runtime. In this work, we instantiate one possible solution as Deformable Kernels (DKs), a family of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Human Pose and Action Recognition · 3D Shape Modeling and Analysis
MethodsCosine Annealing · Random Horizontal Flip · Random Resized Crop · Linear Warmup With Cosine Annealing · SGD with Momentum · Feature Pyramid Network · Region Proposal Network · Softmax · RoIPool · Faster R-CNN
