STRATA: Simple, Gradient-Free Attacks for Models of Code

Jacob M. Springer; Bryn Marie Reinstadler; Una-May O'Reilly

arXiv:2009.13562·cs.LG·August 23, 2021·5 cites

STRATA: Simple, Gradient-Free Attacks for Models of Code

Jacob M. Springer, Bryn Marie Reinstadler, Una-May O'Reilly

PDF

Open Access

TL;DR

This paper introduces STRATA, a simple gradient-free method for generating adversarial examples on code models, exploiting token frequency and embedding relationships to achieve state-of-the-art results efficiently.

Contribution

The paper presents a novel, gradient-free approach for creating adversarial examples in code models, leveraging token frequency and embedding norms.

Findings

01

STRATA outperforms gradient-based methods in effectiveness.

02

The method requires less computational resources.

03

It maintains code functionality while fooling models.

Abstract

Neural networks are well-known to be vulnerable to imperceptible perturbations in the input, called adversarial examples, that result in misclassification. Generating adversarial examples for source code poses an additional challenge compared to the domains of images and natural language, because source code perturbations must retain the functional meaning of the code. We identify a striking relationship between token frequency statistics and learned token embeddings: the L2 norm of learned token embeddings increases with the frequency of the token except for the highest-frequnecy tokens. We leverage this relationship to construct a simple and efficient gradient-free method for generating state-of-the-art adversarial examples on models of code. Our method empirically outperforms competing gradient-based methods with less information and less computational effort.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Anomaly Detection Techniques and Applications