Large-Scale Video Analytics through Object-Level Consolidation
Daniel Rivas, Francesc Guim, Jord\`a Polo, David Carrera

TL;DR
This paper introduces FoMO, a method that enhances large-scale video analytics by consolidating object regions from multiple cameras, significantly improving efficiency and accuracy without additional training.
Contribution
FoMO is a novel approach that preprocesses and consolidates camera feeds to optimize object detection in distributed video analytics systems.
Findings
8x increase in system performance
40% improvement in detection accuracy
No additional training required
Abstract
As the number of installed cameras grows, so do the compute resources required to process and analyze all the images captured by these cameras. Video analytics enables new use cases, such as smart cities or autonomous driving. At the same time, it urges service providers to install additional compute resources to cope with the demand while the strict latency requirements push compute towards the end of the network, forming a geographically distributed and heterogeneous set of compute locations, shared and resource-constrained. Such landscape (shared and distributed locations) forces us to design new techniques that can optimize and distribute work among all available locations and, ideally, make compute requirements grow sublinearly with respect to the number of cameras installed. In this paper, we present FoMO (Focus on Moving Objects). This method effectively optimizes multi-camera…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques
Methodstravel james
