Privacy-Preserving Domain Adaptation of Semantic Parsers
Fatemehsadat Mireshghallah, Yu Su, Tatsunori Hashimoto, Jason Eisner,, Richard Shin

TL;DR
This paper introduces a differentially private method for generating synthetic user utterances to improve semantic parsers without compromising user privacy, enhancing coverage and accuracy in domain adaptation tasks.
Contribution
It presents a novel two-stage DP generation approach that improves semantic coverage and parser accuracy while preserving user privacy.
Findings
MAUVE score improved by 2.5×
Parse tree function type overlap increased by 1.3×
Achieved 8.5% points accuracy gain in domain adaptation
Abstract
Task-oriented dialogue systems often assist users with personal or confidential matters. For this reason, the developers of such a system are generally prohibited from observing actual usage. So how can they know where the system is failing and needs more training data or new functionality? In this work, we study ways in which realistic user utterances can be generated synthetically, to help increase the linguistic and functional coverage of the system, without compromising the privacy of actual users. To this end, we propose a two-stage Differentially Private (DP) generation method which first generates latent semantic parses, and then generates utterances based on the parses. Our proposed approach improves MAUVE by 2.5 and parse tree function type overlap by 1.3 relative to current approaches for private synthetic data generation, improving both on fluency and semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Privacy-Preserving Domain Adaptation of Semantic Parsers· youtube
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Topic Modeling
