FrameHopper: Selective Processing of Video Frames in Detection-driven Real-Time Video Analytics
Md Adnan Arefeen, Sumaiya Tabassum Nimi, and Md Yusuf Sarwar Uddin

TL;DR
FrameHopper is a novel framework that intelligently skips redundant video frames using reinforcement learning, significantly reducing computational costs while maintaining detection accuracy in real-time video analytics on edge devices.
Contribution
It introduces a new error-rate optimization approach and an RL-based method for adaptive frame skipping in detection-driven video analytics, improving efficiency and accuracy.
Findings
Reduces frame processing by up to 90% while maintaining detection quality.
Outperforms recent state-of-the-art methods in real-life scenarios.
Achieves near-oracle detection performance with fewer processed frames.
Abstract
Detection-driven real-time video analytics require continuous detection of objects contained in the video frames using deep learning models like YOLOV3, EfficientDet. However, running these detectors on each and every frame in resource-constrained edge devices is computationally intensive. By taking the temporal correlation between consecutive video frames into account, we note that detection outputs tend to be overlapping in successive frames. Elimination of similar consecutive frames will lead to a negligible drop in performance while offering significant performance benefits by reducing overall computation and communication costs. The key technical questions are, therefore, (a) how to identify which frames to be processed by the object detector, and (b) how many successive frames can be skipped (called skip-length) once a frame is selected to be processed. The overall goal of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications · Video Surveillance and Tracking Methods
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Pointwise Convolution · Depthwise Convolution · Depthwise Separable Convolution · Batch Normalization · BiFPN · EfficientDet
