PKINet-v2: Towards Powerful and Efficient Poly-Kernel Remote Sensing Object Detection
Xinhao Cai, Liulei Li, Gensheng Pei, Zeren Sun, Yazhou Yao, Wenguan Wang

TL;DR
PKINet-v2 introduces a unified backbone for remote sensing object detection that combines anisotropic and isotropic kernels, achieving state-of-the-art accuracy and efficiency by effectively modeling diverse object geometries and scales.
Contribution
The paper proposes PKINet-v2, a novel backbone that jointly models slender and broad objects using a multi-scope receptive field and introduces HKR for efficient inference.
Findings
Achieves state-of-the-art accuracy on multiple benchmarks.
Provides 3.9x faster inference compared to PKINet-v1.
Outperforms previous backbones in effectiveness and efficiency.
Abstract
Object detection in remote sensing images (RSIs) is challenged by the coexistence of geometric and spatial complexity: targets may appear with diverse aspect ratios, while spanning a wide range of object sizes under varied contexts. Existing RSI backbones address the two challenges separately, either by adopting anisotropic strip kernels to model slender targets or by using isotropic large kernels to capture broader context. However, such isolated treatments lead to complementary drawbacks: the strip-only design can disrupt spatial coherence for regular-shaped objects and weaken tiny details, whereas isotropic large kernels often introduce severe background noise and geometric mismatch for slender structures. In this paper, we extend PKINet, and present a powerful and efficient backbone that jointly handles both challenges within a unified paradigm named Poly Kernel Inception Network v2…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Remote-Sensing Image Classification · Domain Adaptation and Few-Shot Learning
