Mining Top-k Trajectory-Patterns from Anonymized Data
Anuj S. Saxena, Siddharth Dawar, Vikram Goyal, Debajyoti Bera

TL;DR
This paper introduces TopKMintra, a novel method for mining top-k activity-trajectory patterns from anonymized GPS data, overcoming challenges posed by privacy-induced data transformations and ensuring efficient pattern discovery.
Contribution
It proposes a new pattern mining technique that transforms anonymized trajectory data into an intermediate form and restricts pattern extensions to handle 2D data complexities.
Findings
Efficiently mines patterns from anonymized data.
Patterns can be retained and discovered despite anonymization.
Method outperforms existing techniques in effectiveness and efficiency.
Abstract
The ubiquity of GPS enabled devices result into the generation of an enormous amount of user movement data consisting of a sequence of locations together with activities performed at those locations. Such data, commonly known as {\it activity-trajectory data}, can be analysed to find common user movements and preferred activities, which will have tremendous business value. However, various countries have now introduced data privacy regulations that make it mandatory to anonymize any data before releasing it. This makes it difficult to look for patterns as the existing mining techniques may not be directly applicable over anonymized data. User locations in an activity-trajectory dataset are anonymized to regions of different shapes and sizes making them uncomparable; therefore, it is unclear how to define suitable patterns over those regions. In this paper, we propose a top-k pattern…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Privacy-Preserving Technologies in Data · Internet Traffic Analysis and Secure E-voting
