DyCAF-Net: Dynamic Class-Aware Fusion Network
Md Abrar Jahin, Shahriar Soudeep, M. F. Mridha, Nafiz Fahad, Md. Jakir Hossen

TL;DR
DyCAF-Net introduces a dynamic, class-aware fusion approach with input-conditioned feature refinement and dual attention mechanisms, significantly improving object detection performance in complex, real-world scenarios while maintaining efficiency.
Contribution
It proposes a novel dynamic fusion framework with class-aware and input-dependent mechanisms, advancing object detection in challenging environments.
Findings
Improves precision and mAP across diverse benchmarks.
Maintains computational efficiency with ~11.1M parameters.
Outperforms nine state-of-the-art baselines.
Abstract
Recent advancements in object detection rely on modular architectures with multi-scale fusion and attention mechanisms. However, static fusion heuristics and class-agnostic attention limit performance in dynamic scenes with occlusions, clutter, and class imbalance. We introduce Dynamic Class-Aware Fusion Network (DyCAF-Net) that addresses these challenges through three innovations: (1) an input-conditioned equilibrium-based neck that iteratively refines multi-scale features via implicit fixed-point modeling, (2) a dual dynamic attention mechanism that adaptively recalibrates channel and spatial responses using input- and class-dependent cues, and (3) class-aware feature adaptation that modulates features to prioritize discriminative regions for rare classes. Through comprehensive ablation studies with YOLOv8 and related architectures, alongside benchmarking against nine state-of-the-art…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
