# An Energy and GPU-Computation Efficient Backbone Network for Real-Time   Object Detection

**Authors:** Youngwan Lee, Joong-won Hwang, Sangrok Lee, Yuseok Bae, Jongyoul Park

arXiv: 1904.09730 · 2019-04-23

## TL;DR

This paper introduces VoVNet, an efficient backbone network for real-time object detection that improves speed and energy efficiency over DenseNet and ResNet by aggregating features only once.

## Contribution

The paper proposes VoVNet with One-Shot Aggregation to enhance feature reuse efficiency, reducing energy consumption and computation overhead in object detection models.

## Key findings

- VoVNet outperforms DenseNet with 2x faster speed and 1.6x-4.1x less energy consumption.
- VoVNet surpasses ResNet in speed and energy efficiency.
- Significant improvement in small object detection performance.

## Abstract

As DenseNet conserves intermediate features with diverse receptive fields by aggregating them with dense connection, it shows good performance on the object detection task. Although feature reuse enables DenseNet to produce strong features with a small number of model parameters and FLOPs, the detector with DenseNet backbone shows rather slow speed and low energy efficiency. We find the linearly increasing input channel by dense connection leads to heavy memory access cost, which causes computation overhead and more energy consumption. To solve the inefficiency of DenseNet, we propose an energy and computation efficient architecture called VoVNet comprised of One-Shot Aggregation (OSA). The OSA not only adopts the strength of DenseNet that represents diversified features with multi receptive fields but also overcomes the inefficiency of dense connection by aggregating all features only once in the last feature maps. To validate the effectiveness of VoVNet as a backbone network, we design both lightweight and large-scale VoVNet and apply them to one-stage and two-stage object detectors. Our VoVNet based detectors outperform DenseNet based ones with 2x faster speed and the energy consumptions are reduced by 1.6x - 4.1x. In addition to DenseNet, VoVNet also outperforms widely used ResNet backbone with faster speed and better energy efficiency. In particular, the small object detection performance has been significantly improved over DenseNet and ResNet.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.09730/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/1904.09730/full.md

## References

32 references — full list in the complete paper: https://tomesphere.com/paper/1904.09730/full.md

---
Source: https://tomesphere.com/paper/1904.09730