Feature Fusion and Knowledge-Distilled Multi-Modal Multi-Target Detection
Ngoc Tuyen Do, Tri Nhu Do

TL;DR
This paper introduces a multi-modal multi-target detection framework using feature fusion and knowledge distillation, significantly improving accuracy and efficiency for resource-constrained embedded systems in surveillance applications.
Contribution
It presents a novel fusion-based multi-modal model combined with a knowledge distillation pipeline, enhancing domain adaptation and reducing inference time.
Findings
Student model achieves 95% of teacher model's accuracy
Inference time is reduced by approximately 50%
Effective knowledge transfer improves detection performance
Abstract
In the surveillance and defense domain, multi-target detection and classification (MTD) is considered essential yet challenging due to heterogeneous inputs from diverse data sources and the computational complexity of algorithms designed for resource-constrained embedded devices, particularly for Al-based solutions. To address these challenges, we propose a feature fusion and knowledge-distilled framework for multi-modal MTD that leverages data fusion to enhance accuracy and employs knowledge distillation for improved domain adaptation. Specifically, our approach utilizes both RGB and thermal image inputs within a novel fusion-based multi-modal model, coupled with a distillation training pipeline. We formulate the problem as a posterior probability optimization task, which is solved through a multi-stage training pipeline supported by a composite loss function. This loss function…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrared Target Detection Methodologies · Fault Detection and Control Systems · Advanced Measurement and Detection Methods
MethodsKnowledge Distillation
