CAF-YOLO: A Robust Framework for Multi-Scale Lesion Detection in Biomedical Imagery
Zilin Chen, Shengnan Lu

TL;DR
CAF-YOLO is a novel multi-scale lesion detection framework combining CNNs and transformers, enhancing the detection of tiny biomedical entities with improved global and local feature modeling.
Contribution
The paper introduces CAF-YOLO, integrating an attention-convolution fusion module and multi-scale neural network to improve detection accuracy of minute lesions in biomedical images.
Findings
Outperforms existing methods on BCCD and LUNA16 datasets.
Effectively detects micro-lesions with high precision.
Enhances global and local feature interaction in biomedical object detection.
Abstract
Object detection is of paramount importance in biomedical image analysis, particularly for lesion identification. While current methodologies are proficient in identifying and pinpointing lesions, they often lack the precision needed to detect minute biomedical entities (e.g., abnormal cells, lung nodules smaller than 3 mm), which are critical in blood and lung pathology. To address this challenge, we propose CAF-YOLO, based on the YOLOv8 architecture, a nimble yet robust method for medical object detection that leverages the strengths of convolutional neural networks (CNNs) and transformers. To overcome the limitation of convolutional kernels, which have a constrained capacity to interact with distant information, we introduce an attention and convolution fusion module (ACFM). This module enhances the modeling of both global and local features, enabling the capture of long-term feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification · AI in cancer detection
MethodsSoftmax · Attention Is All You Need · Convolution · You Only Look Once
