TL;DR
PhyAug leverages physical principles to generate augmented training data, significantly improving deep sensing model transferability across domains with minimal target data, demonstrated in speech recognition and seismic localization.
Contribution
It introduces a physics-based data augmentation method that reduces target domain data requirements for transfer learning in sensing systems.
Findings
Achieves 37% to 72% accuracy recovery in speech recognition.
Requires only 3% to 8% of labeled data for seismic localization.
Effectively mitigates domain shifts using physics-informed data augmentation.
Abstract
Run-time domain shifts from training-phase domains are common in sensing systems designed with deep learning. The shifts can be caused by sensor characteristic variations and/or discrepancies between the design-phase model and the actual model of the sensed physical process. To address these issues, existing transfer learning techniques require substantial target-domain data and thus incur high post-deployment overhead. This paper proposes to exploit the first principle governing the domain shift to reduce the demand on target-domain data. Specifically, our proposed approach called PhyAug uses the first principle fitted with few labeled or unlabeled source/target-domain data pairs to transform the existing source-domain training data into augmented data for updating the deep neural networks. In two case studies of keyword spotting and DeepSpeech2-based automatic speech recognition, with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
