SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural   Networks for Real-Time Object Detection for Autonomous Driving

Bichen Wu; Alvin Wan; Forrest Iandola; Peter H. Jin; Kurt Keutzer

arXiv:1612.01051·cs.CV·June 12, 2019·93 cites

SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural Networks for Real-Time Object Detection for Autonomous Driving

Bichen Wu, Alvin Wan, Forrest Iandola, Peter H. Jin, Kurt Keutzer

PDF

Open Access 5 Repos

TL;DR

SqueezeDet is a fully convolutional neural network designed for real-time, energy-efficient object detection in autonomous driving, achieving high accuracy with significantly smaller size and faster inference than previous models.

Contribution

The paper introduces SqueezeDet, a novel fully convolutional network that combines small size, high speed, and accuracy for autonomous vehicle object detection.

Findings

01

30.4x smaller model size

02

19.7x faster inference speed

03

35.2x lower energy consumption

Abstract

Object detection is a crucial task for autonomous driving. In addition to requiring high accuracy to ensure safety, object detection for autonomous driving also requires real-time inference speed to guarantee prompt vehicle control, as well as small model size and energy efficiency to enable embedded system deployment. In this work, we propose SqueezeDet, a fully convolutional neural network for object detection that aims to simultaneously satisfy all of the above constraints. In our network, we use convolutional layers not only to extract feature maps but also as the output layer to compute bounding boxes and class probabilities. The detection pipeline of our model only contains a single forward pass of a neural network, thus it is extremely fast. Our model is fully-convolutional, which leads to a small model size and better energy efficiency. While achieving the same accuracy as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Autonomous Vehicle Technology and Safety · Video Surveillance and Tracking Methods

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings