A Structured Analysis of the Video Degradation Effects on the Performance of a Machine Learning-enabled Pedestrian Detector
Christian Berger

TL;DR
This study systematically analyzes how various video degradation techniques affect the performance of a YOLO-based pedestrian detector, revealing that certain lossy compression settings can maintain performance while reducing data size.
Contribution
It provides a structured evaluation of video compression effects on pedestrian detection accuracy, highlighting configurations that balance performance and data efficiency.
Findings
Aggressive lossy compression significantly reduces detection performance.
Some compression settings slightly improve detection accuracy.
Carefully selected compression preserves detection performance with data savings.
Abstract
ML-enabled software systems have been incorporated in many public demonstrations for automated driving (AD) systems. Such solutions have also been considered as a crucial approach to aim at SAE Level 5 systems, where the passengers in such vehicles do not have to interact with the system at all anymore. Already in 2016, Nvidia demonstrated a complete end-to-end approach for training the complete software stack covering perception, planning and decision making, and the actual vehicle control. While such approaches show the great potential of such ML-enabled systems, there have also been demonstrations where already changes to single pixels in a video frame can potentially lead to completely different decisions with dangerous consequences. In this paper, a structured analysis has been conducted to explore video degradation effects on the performance of an ML-enabled pedestrian detector.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsYou Only Look Once
