TL;DR
This paper introduces SATA, a semantics-aware traffic augmentation framework that enhances website fingerprinting models' generalization by aligning application semantics with observable traffic features, significantly improving performance in diverse scenarios.
Contribution
SATA is a novel framework that performs semantic traffic augmentation and cross-layer feature alignment to improve website fingerprinting robustness and generalization.
Findings
SATA improves accuracy by 90.81% in open-world settings.
SATA enhances AUROC by 48.37%.
Generated traffic patterns are more representative of real-world test data.
Abstract
Deep learning-based website fingerprinting has emerged as an effective technique for inferring the websites users visit. Although existing methods achieve strong performance on closed-world datasets, they often fail to generalize to real-world environments, especially under geographic and temporal shifts. This limitation fundamentally stems from the coupled effects of two key challenges: application-layer resource composition variability and observable feature instability induced by cross-layer encapsulation. Intertwined, these factors induce systematic shifts between underlying application semantics and observable traffic features. To address the above challenges, we propose SATA , a semantics-aware traffic augmentation framework. Specifically, SATA first performs application-layer semantic augmentation based on protocol rules, expanding the resource composition patterns within each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
