# Fast object detection in compressed JPEG Images

**Authors:** Benjamin Deguerre, Cl\'ement Chatelain, Gilles Gasso

arXiv: 1904.08408 · 2020-06-23

## TL;DR

This paper introduces a novel deep learning architecture that detects objects directly from JPEG compressed images using DCT coefficients, significantly reducing processing time compared to traditional methods that require decompression.

## Contribution

It presents the first method for object detection directly on JPEG compressed images by modifying SSD to process DCT coefficients, achieving faster detection without decompression.

## Key findings

- Model is approximately 2 times faster than standard SSD.
- Achieves promising detection performance on benchmark and industrial datasets.
- First approach to directly detect objects in JPEG compressed images.

## Abstract

Object detection in still images has drawn a lot of attention over past few years, and with the advent of Deep Learning impressive performances have been achieved with numerous industrial applications. Most of these deep learning models rely on RGB images to localize and identify objects in the image. However in some application scenarii, images are compressed either for storage savings or fast transmission. Therefore a time consuming image decompression step is compulsory in order to apply the aforementioned deep models. To alleviate this drawback, we propose a fast deep architecture for object detection in JPEG images, one of the most widespread compression format. We train a neural network to detect objects based on the blockwise DCT (discrete cosine transform) coefficients {issued from} the JPEG compression algorithm. We modify the well-known Single Shot multibox Detector (SSD) by replacing its first layers with one convolutional layer dedicated to process the DCT inputs. Experimental evaluations on PASCAL VOC and industrial dataset comprising images of road traffic surveillance show that the model is about $2\times$ faster than regular SSD with promising detection performances. To the best of our knowledge, this paper is the first to address detection in compressed JPEG images.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.08408/full.md

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/1904.08408/full.md

## References

23 references — full list in the complete paper: https://tomesphere.com/paper/1904.08408/full.md

---
Source: https://tomesphere.com/paper/1904.08408