TL;DR
DEYOLO is a novel cross-modality object detection network that effectively fuses RGB and infrared images by mutual enhancement modules, significantly improving detection in poor-illumination environments.
Contribution
The paper introduces DEYOLO, a dual-feature-enhancement network with novel modules for mutual feature enhancement and a bi-directional focus mechanism, advancing cross-modality object detection.
Findings
Outperforms state-of-the-art methods on M3FD and LLVIP datasets.
Effectively reduces interference between RGB and infrared modalities.
Enhances feature representation for better detection accuracy.
Abstract
Object detection in poor-illumination environments is a challenging task as objects are usually not clearly visible in RGB images. As infrared images provide additional clear edge information that complements RGB images, fusing RGB and infrared images has potential to enhance the detection ability in poor-illumination environments. However, existing works involving both visible and infrared images only focus on image fusion, instead of object detection. Moreover, they directly fuse the two kinds of image modalities, which ignores the mutual interference between them. To fuse the two modalities to maximize the advantages of cross-modality, we design a dual-enhancement-based cross-modality object detection network DEYOLO, in which semantic-spatial cross modality and novel bi-directional decoupled focus modules are designed to achieve the detection-centered mutual enhancement of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFocus
