Octave-YOLO: Cross frequency detection network with octave convolution
Sangjune Shin, Dongkun Shin

TL;DR
Octave-YOLO introduces a cross frequency detection network that efficiently processes high-resolution images in real-time on embedded devices by dividing features into frequency-based sections, reducing computation while maintaining accuracy.
Contribution
The paper proposes Octave-YOLO with CFPNet, enabling real-time high-resolution object detection on embedded systems through frequency-based feature separation and efficient convolution techniques.
Findings
Matches YOLOv8 performance with 40% fewer parameters
1.56x faster than YOLOv8 at 1080x1080 resolution
Significantly reduces computational demands while maintaining accuracy
Abstract
Despite the rapid advancement of object detection algorithms, processing high-resolution images on embedded devices remains a significant challenge. Theoretically, the fully convolutional network architecture used in current real-time object detectors can handle all input resolutions. However, the substantial computational demands required to process high-resolution images render them impractical for real-time applications. To address this issue, real-time object detection models typically downsample the input image for inference, leading to a loss of detail and decreased accuracy. In response, we developed Octave-YOLO, designed to process high-resolution images in real-time within the constraints of embedded systems. We achieved this through the introduction of the cross frequency partial network (CFPNet), which divides the input feature map into low-resolution, low-frequency, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBlind Source Separation Techniques
MethodsSoftmax · Attention Is All You Need · Convolution · Pointwise Convolution · Depthwise Convolution · Depthwise Separable Convolution · You Only Look Once
