TL;DR
Scanner is a system that enables efficient, scalable, and productive analysis of large video datasets by organizing videos as tables, executing pixel processing on heterogeneous hardware, and supporting complex applications that run from minutes to hours.
Contribution
The paper introduces Scanner, a novel system that simplifies and accelerates large-scale video analysis through optimized data organization, hardware scheduling, and support for diverse applications.
Findings
Supports analysis of hundreds of feature-length films and 70,000 hours of TV news.
Achieves near-expert performance on a single machine.
Scales efficiently to hundreds of machines, reducing analysis time from days to hours.
Abstract
A growing number of visual computing applications depend on the analysis of large video collections. The challenge is that scaling applications to operate on these datasets requires efficient systems for pixel data access and parallel processing across large numbers of machines. Few programmers have the capability to operate efficiently at these scales, limiting the field's ability to explore new applications that leverage big video data. In response, we have created Scanner, a system for productive and efficient video analysis at scale. Scanner organizes video collections as tables in a data store optimized for sampling frames from compressed video, and executes pixel processing computations, expressed as dataflow graphs, on these frames. Scanner schedules video analysis applications expressed using these abstractions onto heterogeneous throughput computing hardware, such as multi-core…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
