Understanding and Improving Usability of Data Dashboards for Simplified Privacy Control of Voice Assistant Data (Extended Version)
Vandit Sharma, Mainack Mondal

TL;DR
This study investigates Google Voice Assistant users' perceptions of data privacy, revealing limited awareness and discomfort sharing certain data, and proposes a classifier to enhance dashboard usability for privacy management.
Contribution
The paper provides a real-data-driven analysis of user privacy perceptions and introduces a sensitive data classifier to improve dashboard effectiveness.
Findings
Most users have superficial knowledge of GVA data collection.
Participants felt uncomfortable sharing 17.7% of GVA data.
A sensitive data classifier achieved 95% F1-score, improving dashboard recommendations.
Abstract
Today, intelligent voice assistant (VA) software like Amazon's Alexa, Google's Voice Assistant (GVA) and Apple's Siri have millions of users. These VAs often collect and analyze huge user data for improving their functionality. However, this collected data may contain sensitive information (e.g., personal voice recordings) that users might not feel comfortable sharing with others and might cause significant privacy concerns. To counter such concerns, service providers like Google present their users with a personal data dashboard (called `My Activity Dashboard'), allowing them to manage all voice assistant collected data. However, a real-world GVA-data driven understanding of user perceptions and preferences regarding this data (and data dashboards) remained relatively unexplored in prior research. To that end, in this work we focused on Google Voice Assistant (GVA) users and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Privacy-Preserving Technologies in Data · Data Stream Mining Techniques
