Task-Specific Context Decoupling for Object Detection
Jiayuan Zhuang, Zheng Qin, Hao Yu, Xucan Chen

TL;DR
This paper introduces TSCODE, a novel detection head that disentangles features for classification and localization, improving object detection accuracy by over 1.0 AP with minimal additional computation.
Contribution
The paper proposes a new task-specific feature decoupling head that enhances feature representations for classification and localization in object detection.
Findings
Improves detection accuracy by over 1.0 AP across various detectors.
Provides a plug-and-play module easily integrated into existing pipelines.
Achieves better feature encoding for classification and localization tasks.
Abstract
Classification and localization are two main sub-tasks in object detection. Nonetheless, these two tasks have inconsistent preferences for feature context, i.e., localization expects more boundary-aware features to accurately regress the bounding box, while more semantic context is preferred for object classification. Exsiting methods usually leverage disentangled heads to learn different feature context for each task. However, the heads are still applied on the same input features, which leads to an imperfect balance between classifcation and localization. In this work, we propose a novel Task-Specific COntext DEcoupling (TSCODE) head which further disentangles the feature encoding for two tasks. For classification, we generate spatially-coarse but semantically-strong feature encoding. For localization, we provide high-resolution feature map containing more edge information to better…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
