Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous Car Case Study
Fitash Ul Haq, Donghwan Shin, Shiva Nejati, Lionel Briand

TL;DR
This study compares offline and online testing modes for DNNs in autonomous driving, evaluating the reliability of simulator data and the differences in safety violation detection between testing approaches.
Contribution
It provides an empirical comparison of offline and online testing for DNNs in autonomous vehicles, highlighting the effectiveness of simulator data and differences in safety violation detection.
Findings
Simulator data yields similar prediction errors to real-world data.
Offline testing is more optimistic than online testing in safety violation detection.
Online testing detects safety violations not found in offline testing.
Abstract
There is a growing body of research on developing testing techniques for Deep Neural Networks (DNN). We distinguish two general modes of testing for DNNs: Offline testing where DNNs are tested as individual units based on test datasets obtained independently from the DNNs under test, and online testing where DNNs are embedded into a specific application and tested in a close-loop mode in interaction with the application environment. In addition, we identify two sources for generating test datasets for DNNs: Datasets obtained from real-life and datasets generated by simulators. While offline testing can be used with datasets obtained from either sources, online testing is largely confined to using simulators since online testing within real-life applications can be time-consuming, expensive and dangerous. In this paper, we study the following two important questions aiming to compare…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTest
