TL;DR
This paper explores how to create adversarial input sequences that deceive recurrent neural networks, extending previous adversarial techniques from image classification to sequential data, demonstrating their effectiveness in misleading RNNs.
Contribution
It adapts existing adversarial sample crafting algorithms to recurrent neural networks and demonstrates their success on sequential data.
Findings
Adversarial sequences can mislead RNNs in classification tasks.
Existing algorithms for feed-forward networks can be adapted for RNNs.
Adversarial sequences are effective against both categorical and sequential RNNs.
Abstract
Machine learning models are frequently used to solve complex security problems, as well as to make decisions in sensitive situations like guiding autonomous vehicles or predicting financial market behaviors. Previous efforts have shown that numerous machine learning models were vulnerable to adversarial manipulations of their inputs taking the form of adversarial samples. Such inputs are crafted by adding carefully selected perturbations to legitimate inputs so as to force the machine learning model to misbehave, for instance by outputting a wrong class if the machine learning task of interest is classification. In fact, to the best of our knowledge, all previous work on adversarial samples crafting for neural network considered models used to solve classification tasks, most frequently in computer vision applications. In this paper, we contribute to the field of adversarial machine…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
