TA3: Testing Against Adversarial Attacks on Machine Learning Models

Yuanzhe Jin; Min Chen

arXiv:2410.05334·cs.CR·October 10, 2024

TA3: Testing Against Adversarial Attacks on Machine Learning Models

Yuanzhe Jin, Min Chen

PDF

Open Access

TL;DR

This paper introduces TA3, an interactive system that incorporates human-in-the-loop techniques to test decision tree models against adversarial attacks, enhancing evaluation and understanding of model robustness.

Contribution

The paper presents the design of TA3, a novel interactive system that integrates human expertise into adversarial testing workflows for machine learning models.

Findings

01

HITL enhances attack simulation and impact evaluation.

02

TA3 effectively tests decision trees against One Pixel Attack.

03

Potential for extending HITL to other ML models and attacks.

Abstract

Adversarial attacks are major threats to the deployment of machine learning (ML) models in many applications. Testing ML models against such attacks is becoming an essential step for evaluating and improving ML models. In this paper, we report the design and development of an interactive system for aiding the workflow of Testing Against Adversarial Attacks (TA3). In particular, with TA3, human-in-the-loop (HITL) enables human-steered attack simulation and visualization-assisted attack impact evaluation. While the current version of TA3 focuses on testing decision tree models against adversarial attacks based on the One Pixel Attack Method, it demonstrates the importance of HITL in ML testing and the potential application of HITL to the ML testing workflows for other types of ML models and other types of adversarial attacks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning