YOGA: Deep Object Detection in the Wild with Lightweight Feature   Learning and Multiscale Attention

Raja Sunkara; Tie Luo

arXiv:2307.05945·cs.CV·July 13, 2023

YOGA: Deep Object Detection in the Wild with Lightweight Feature Learning and Multiscale Attention

Raja Sunkara, Tie Luo

PDF

TL;DR

YOGA is a lightweight, scalable object detection model that achieves a high accuracy-to-size ratio on edge devices by using efficient feature learning and attention mechanisms.

Contribution

The paper introduces YOGA, a novel lightweight object detector with a two-phase feature learning pipeline and multiscale attention, optimized for low-end edge devices.

Findings

01

YOGA achieves up to 22% higher AP compared to other models.

02

YOGA reduces parameters and FLOPs by 23-34%.

03

YOGA performs well on NVIDIA Jetson Nano.

Abstract

We introduce YOGA, a deep learning based yet lightweight object detection model that can operate on low-end edge devices while still achieving competitive accuracy. The YOGA architecture consists of a two-phase feature learning pipeline with a cheap linear transformation, which learns feature maps using only half of the convolution filters required by conventional convolutional neural networks. In addition, it performs multi-scale feature fusion in its neck using an attention mechanism instead of the naive concatenation used by conventional detectors. YOGA is a flexible model that can be easily scaled up or down by several orders of magnitude to fit a broad range of hardware constraints. We evaluate YOGA on COCO-val and COCO-testdev datasets with other over 10 state-of-the-art object detectors. The results show that YOGA strikes the best trade-off between model size and accuracy (up to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConvolution