Sharing FANCI Features: A Privacy Analysis of Feature Extraction for DGA Detection
Benedikt Holmes, Arthur Drichel, Ulrike Meyer

TL;DR
This paper evaluates the privacy risks of sharing features from FANCI, a DGA detection tool, finding that the feature extraction process offers strong privacy protection against domain name reconstruction attacks.
Contribution
The study provides an empirical and theoretical analysis demonstrating that FANCI's feature extractor resists inversion attacks, supporting privacy-preserving sharing of features.
Findings
Reconstruction attacks on FANCI features perform poorly.
Mathematical analysis supports the robustness of FANCI's feature extraction.
Sharing FANCI features does not significantly compromise privacy.
Abstract
The goal of Domain Generation Algorithm (DGA) detection is to recognize infections with bot malware and is often done with help of Machine Learning approaches that classify non-resolving Domain Name System (DNS) traffic and are trained on possibly sensitive data. In parallel, the rise of privacy research in the Machine Learning world leads to privacy-preserving measures that are tightly coupled with a deep learning model's architecture or training routine, while non deep learning approaches are commonly better suited for the application of privacy-enhancing methods outside the actual classification module. In this work, we aim to measure the privacy capability of the feature extractor of feature-based DGA detector FANCI (Feature-based Automated Nxdomain Classification and Intelligence). Our goal is to assess whether a data-rich adversary can learn an inverse mapping of FANCI's feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Network Security and Intrusion Detection · Privacy-Preserving Technologies in Data
