Rate-Accuracy Trade-Off In Video Classification With Deep Convolutional Neural Networks
Mohammad Jubran, Alhabib Abbas, Aaron Chadha, Yiannis Andreopoulos

TL;DR
This paper explores how to reduce bandwidth in video classification by using compressed video data directly, achieving significant bitrate savings with minimal accuracy loss across different CNN models and datasets.
Contribution
It introduces a method to perform CNN-based video classification directly on compressed video data, reducing bandwidth without complex pre-processing.
Findings
Achieved 11%-94% bitrate reduction with minimal accuracy impact.
Using motion vectors and texture info at low bitrates enables effective classification.
Potential for classification at as low as 3 kbps with acceptable accuracy loss.
Abstract
Advanced video classification systems decode video frames to derive the necessary texture and motion representations for ingestion and analysis by spatio-temporal deep convolutional neural networks (CNNs). However, when considering visual Internet-of-Things applications, surveillance systems and semantic crawlers of large video repositories, the video capture and the CNN-based semantic analysis parts do not tend to be co-located. This necessitates the transport of compressed video over networks and incurs significant overhead in bandwidth and energy consumption, thereby significantly undermining the deployment potential of such systems. In this paper, we investigate the trade-off between the encoding bitrate and the achievable accuracy of CNN-based video classification models that directly ingest AVC/H.264 and HEVC encoded videos. Instead of retaining entire compressed video bitstreams…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Advanced Vision and Imaging · Anomaly Detection Techniques and Applications
