SEA: Shareable and Explainable Attribution for Query-based Black-box Attacks
Yue Gao, Ilia Shumailov, Kassem Fawaz

TL;DR
SEA is a security system that attributes query-based black-box attacks on ML models using Hidden Markov Models, enabling forensic analysis and explainable threat intelligence sharing, effective even against adaptive attackers.
Contribution
Introduces SEA, a novel system that characterizes black-box attacks for forensic purposes and facilitates human-explainable sharing of attack information using Hidden Markov Models.
Findings
SEA achieves over 90% Top-1 attack attribution accuracy.
SEA is robust to adaptive attack strategies.
It can identify specific bugs in attack libraries.
Abstract
Machine Learning (ML) systems are vulnerable to adversarial examples, particularly those from query-based black-box attacks. Despite various efforts to detect and prevent such attacks, ML systems are still at risk, demanding a more comprehensive approach to security that includes logging, analyzing, and sharing evidence. While traditional security benefits from well-established practices of forensics and threat intelligence sharing, ML security has yet to find a way to profile its attackers and share information about them. In response, this paper introduces SEA, a novel ML security system to characterize black-box attacks on ML systems for forensic purposes and to facilitate human-explainable intelligence sharing. SEA leverages Hidden Markov Models to attribute the observed query sequence to known attacks. It thus understands the attack's progression rather than focusing solely on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Digital and Cyber Forensics · Anomaly Detection Techniques and Applications
