Video-Data Pipelines for Machine Learning Applications
Sohini Roychowdhury, James Y. Sato

TL;DR
This paper introduces an automated video-data pipeline framework that efficiently isolates high-quality, representative frames from video sequences, significantly reducing manual effort and enabling scalable ML deployment for autonomous driving applications.
Contribution
The work presents a novel automated pipeline for frame selection and tagging in video sequences, improving efficiency and scalability for machine learning data preparation.
Findings
Retains 0.1-20% of input frames based on quality and content.
Automates scene tagging and model verification within 30 seconds per sequence.
Scalable to larger video datasets for autonomous driving applications.
Abstract
Data pipelines are an essential component for end-to-end solutions that take machine learning algorithms to production. Engineering data pipelines for video-sequences poses several challenges including isolation of key-frames from video sequences that are high quality and represent significant variations in the scene. Manual isolation of such quality key-frames can take hours of sifting through hours worth of video data. In this work, we present a data pipeline framework that can automate this process of manual frame sifting in video sequences by controlling the fraction of frames that can be removed based on image quality and content type. Additionally, the frames that are retained can be automatically tagged per sequence, thereby simplifying the process of automated data retrieval for future ML model deployments. We analyze the performance of the proposed video-data pipeline for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Anomaly Detection Techniques and Applications
