Adversarial Edit Attacks for Tree Data

Benjamin Paa{\ss}en

arXiv:1908.09364·cs.LG·November 12, 2019

Adversarial Edit Attacks for Tree Data

Benjamin Paa{\ss}en

PDF

TL;DR

This paper introduces a novel black-box adversarial attack method for tree-structured data, demonstrating its effectiveness on machine learning models in medicine and program analysis without requiring gradient information.

Contribution

It extends adversarial attack techniques to tree data using tree edit distance and black-box queries, a novel approach in this domain.

Findings

01

Effective attacks on tree classifiers like tree-kernel-SVMs and recursive neural networks.

02

Requires only logarithmic black-box queries and no gradient information.

03

Applicable to biomedical and programming data sets.

Abstract

Many machine learning models can be attacked with adversarial examples, i.e. inputs close to correctly classified examples that are classified incorrectly. However, most research on adversarial attacks to date is limited to vectorial data, in particular image data. In this contribution, we extend the field by introducing adversarial edit attacks for tree-structured data with potential applications in medicine and automated program analysis. Our approach solely relies on the tree edit distance and a logarithmic number of black-box queries to the attacked classifier without any need for gradient information. We evaluate our approach on two programming and two biomedical data sets and show that many established tree classifiers, like tree-kernel-SVMs and recursive neural networks, can be attacked effectively.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.