QYOLO: Lightweight Object Detection via Quantum Inspired Shared Channel Mixing

Garvit Kumar Mittal; Sahil Tomar; Sandeep Kumar

arXiv:2604.26435·cs.CV·April 30, 2026

QYOLO: Lightweight Object Detection via Quantum Inspired Shared Channel Mixing

Garvit Kumar Mittal, Sahil Tomar, Sandeep Kumar

PDF

TL;DR

QYOLO introduces a quantum-inspired channel mixing method that significantly reduces model size and computation in object detection while maintaining accuracy, enabling more efficient real-time applications.

Contribution

This work presents a novel quantum-inspired shared channel mixing framework replacing deep backbone modules, achieving architectural compression with minimal accuracy loss.

Findings

01

QYOLOv8n reduces parameters by 20.2% with 0.4 pp mAP@50 degradation.

02

QYOLOv8s achieves 21.8% parameter reduction with 0.1 pp mAP@50 degradation.

03

Combined with distillation, full accuracy parity is recovered at no additional cost.

Abstract

The rapid advancement of object detection architectures has positioned single stage detectors as the dominant solution for real-time visual perception. A primary source of computational overhead in these models lies in the deep backbone stages, where C2f bottleneck modules at high stride levels accumulate a disproportionate share of parameters due to quadratic scaling with channel width. This work introduces QYOLO, a quantum-inspired channel mixing framework that achieves genuine architectural compression by replacing the two deepest backbone C2f modules at P4/16 (512 channels) and P5/32 (1024 channels) with a compact QMixBlock. The proposed block performs global channel recalibration through a sinusoidal mixing mechanism with shared learnable parameters across both backbone stages, enforcing consistent channel importance without requiring independent per-stage parameter sets. The neck…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.