Learnable Instance Attention Filtering for Adaptive Detector Distillation

Chen Liu; Qizhen Lan; Zhicheng Ding; Xinyu Chu; Qing Tian

arXiv:2603.26088·cs.CV·March 30, 2026

Learnable Instance Attention Filtering for Adaptive Detector Distillation

Chen Liu, Qizhen Lan, Zhicheng Ding, Xinyu Chu, Qing Tian

PDF

TL;DR

This paper introduces LIAF-KD, a learnable instance attention filtering framework that adaptively reweights object instances during detector knowledge distillation, leading to improved performance on KITTI and COCO datasets.

Contribution

It presents a novel learnable instance selector mechanism that dynamically evaluates instance importance, enhancing detector distillation beyond heuristic or teacher-driven methods.

Findings

01

Achieves a 2% performance gain on GFL ResNet-50 without extra complexity.

02

Demonstrates consistent improvements on KITTI and COCO datasets.

03

Outperforms existing state-of-the-art distillation methods.

Abstract

As deep vision models grow increasingly complex to achieve higher performance, deployment efficiency has become a critical concern. Knowledge distillation (KD) mitigates this issue by transferring knowledge from large teacher models to compact student models. While many feature-based KD methods rely on spatial filtering to guide distillation, they typically treat all object instances uniformly, ignoring instance-level variability. Moreover, existing attention filtering mechanisms are typically heuristic or teacher-driven, rather than learned with the student. To address these limitations, we propose Learnable Instance Attention Filtering for Adaptive Detector Distillation (LIAF-KD), a novel framework that introduces learnable instance selectors to dynamically evaluate and reweight instance importance during distillation. Notably, the student contributes to this process based on its…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.