More Than Meets the Eye: A Semantics-Aware Traffic Augmentation Framework for Generalizable Website Fingerprinting

Youquan Xian; Xueying Zeng; Lingjia Meng; Lei Cui; Runhan Song; Wei Wang; Zhengquan Ding; Peng Liu; Zhiyu Hao

arXiv:2605.11402·cs.LG·May 13, 2026

More Than Meets the Eye: A Semantics-Aware Traffic Augmentation Framework for Generalizable Website Fingerprinting

Youquan Xian, Xueying Zeng, Lingjia Meng, Lei Cui, Runhan Song, Wei Wang, Zhengquan Ding, Peng Liu, Zhiyu Hao

PDF

1 Repo

TL;DR

This paper introduces SATA, a semantics-aware traffic augmentation framework that enhances website fingerprinting models' generalization by aligning application semantics with observable traffic features, significantly improving performance in diverse scenarios.

Contribution

SATA is a novel framework that performs semantic traffic augmentation and cross-layer feature alignment to improve website fingerprinting robustness and generalization.

Findings

01

SATA improves accuracy by 90.81% in open-world settings.

02

SATA enhances AUROC by 48.37%.

03

Generated traffic patterns are more representative of real-world test data.

Abstract

Deep learning-based website fingerprinting has emerged as an effective technique for inferring the websites users visit. Although existing methods achieve strong performance on closed-world datasets, they often fail to generalize to real-world environments, especially under geographic and temporal shifts. This limitation fundamentally stems from the coupled effects of two key challenges: application-layer resource composition variability and observable feature instability induced by cross-layer encapsulation. Intertwined, these factors induce systematic shifts between underlying application semantics and observable traffic features. To address the above challenges, we propose SATA , a semantics-aware traffic augmentation framework. Specifically, SATA first performs application-layer semantic augmentation based on protocol rules, expanding the resource composition patterns within each…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://anonymous.4open.science/r/SATA-B6C2
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.