Superstring-Based Sequence Obfuscation to Thwart Pattern Matching Attacks
Bo Guan, Nazanin Takbiri, Dennis Goeckel, Amir Houmansadr, Hossein, Pishro-Nik

TL;DR
This paper introduces a novel sequence obfuscation method using superstring-based algorithms to prevent pattern matching attacks, enhancing user privacy without relying on statistical models.
Contribution
It presents data-independent algorithms for sequence obfuscation with provable guarantees and also explores data-dependent approaches, validated on synthetic and real datasets.
Findings
Effective obfuscation of user data traces
Provable guarantees for sequence construction
Successful evaluation on synthetic and real data
Abstract
User privacy can be compromised by matching user data traces to records of their previous behavior. The matching of the statistical characteristics of traces to prior user behavior has been widely studied. However, an adversary can also identify a user deterministically by searching data traces for a pattern that is unique to that user. Our goal is to thwart such an adversary by applying small artificial distortions to data traces such that each potentially identifying pattern is shared by a large number of users. Importantly, in contrast to statistical approaches, we develop data-independent algorithms that require no assumptions on the model by which the traces are generated. By relating the problem to a set of combinatorial questions on sequence construction, we are able to provide provable guarantees for our proposed constructions. We also introduce data-dependent approaches for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Internet Traffic Analysis and Secure E-voting · Digital and Cyber Forensics
