DEYOLO: Dual-Feature-Enhancement YOLO for Cross-Modality Object   Detection

Yishuo Chen; Boran Wang; Xinyu Guo; Wenbin Zhu; Jiasheng He; Xiaobin; Liu; Jing Yuan

arXiv:2412.04931·cs.CV·December 9, 2024

DEYOLO: Dual-Feature-Enhancement YOLO for Cross-Modality Object Detection

Yishuo Chen, Boran Wang, Xinyu Guo, Wenbin Zhu, Jiasheng He, Xiaobin, Liu, Jing Yuan

PDF

1 Repo

TL;DR

DEYOLO is a novel cross-modality object detection network that effectively fuses RGB and infrared images by mutual enhancement modules, significantly improving detection in poor-illumination environments.

Contribution

The paper introduces DEYOLO, a dual-feature-enhancement network with novel modules for mutual feature enhancement and a bi-directional focus mechanism, advancing cross-modality object detection.

Findings

01

Outperforms state-of-the-art methods on M3FD and LLVIP datasets.

02

Effectively reduces interference between RGB and infrared modalities.

03

Enhances feature representation for better detection accuracy.

Abstract

Object detection in poor-illumination environments is a challenging task as objects are usually not clearly visible in RGB images. As infrared images provide additional clear edge information that complements RGB images, fusing RGB and infrared images has potential to enhance the detection ability in poor-illumination environments. However, existing works involving both visible and infrared images only focus on image fusion, instead of object detection. Moreover, they directly fuse the two kinds of image modalities, which ignores the mutual interference between them. To fuse the two modalities to maximize the advantages of cross-modality, we design a dual-enhancement-based cross-modality object detection network DEYOLO, in which semantic-spatial cross modality and novel bi-directional decoupled focus modules are designed to achieve the detection-centered mutual enhancement of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chips96/deyolo
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFocus