Can Offline Testing of Deep Neural Networks Replace Their Online Testing?
Fitash Ul Haq, Donghwan Shin, Shiva Nejati, Lionel Briand

TL;DR
This paper investigates whether offline testing of deep neural networks can replace online testing in autonomous driving, finding offline testing less effective for safety violations and not a cost-effective substitute.
Contribution
It provides an empirical analysis of the relationship between offline and online testing for DNNs in autonomous driving, highlighting limitations of offline testing as a replacement.
Findings
Offline testing misses many safety violations detected by online testing.
Large prediction errors in offline testing correlate with severe safety violations.
Offline testing cannot reliably reduce online testing costs in practice.
Abstract
We distinguish two general modes of testing for Deep Neural Networks (DNNs): Offline testing where DNNs are tested as individual units based on test datasets obtained without involving the DNNs under test, and online testing where DNNs are embedded into a specific application environment and tested in a closed-loop mode in interaction with the application environment. Typically, DNNs are subjected to both types of testing during their development life cycle where offline testing is applied immediately after DNN training and online testing follows after offline testing and once a DNN is deployed within a specific application environment. In this paper, we study the relationship between offline and online testing. Our goal is to determine how offline testing and online testing differ or complement one another and if offline testing results can be used to help reduce the cost of online…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
