Exploring Adversarial Robustness in Classification tasks using DNA   Language Models

Hyunwoo Yoo; Haebin Shin; Kaidi Xu; Gail Rosen

arXiv:2409.19788·cs.CL·March 4, 2025

Exploring Adversarial Robustness in Classification tasks using DNA Language Models

Hyunwoo Yoo, Haebin Shin, Kaidi Xu, Gail Rosen

PDF

Open Access

TL;DR

This paper investigates the vulnerability of DNA language models to adversarial attacks at multiple levels, revealing significant susceptibility and demonstrating that adversarial training can improve their robustness and accuracy.

Contribution

It provides a comprehensive analysis of adversarial robustness in DNA language models and introduces adversarial training as an effective defense mechanism.

Findings

01

DNA models are highly susceptible to adversarial attacks

02

Adversarial training improves model robustness and accuracy

03

Model performance degrades significantly under attack

Abstract

DNA Language Models, such as GROVER, DNABERT2 and the Nucleotide Transformer, operate on DNA sequences that inherently contain sequencing errors, mutations, and laboratory-induced noise, which may significantly impact model performance. Despite the importance of this issue, the robustness of DNA language models remains largely underexplored. In this paper, we comprehensivly investigate their robustness in DNA classification by applying various adversarial attack strategies: the character (nucleotide substitutions), word (codon modifications), and sentence levels (back-translation-based transformations) to systematically analyze model vulnerabilities. Our results demonstrate that DNA language models are highly susceptible to adversarial attacks, leading to significant performance degradation. Furthermore, we explore adversarial training method as a defense mechanism, which enhances both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Machine Learning and Algorithms · Digital Media Forensic Detection

MethodsLinear Layer · Multi-Head Attention · Layer Normalization · Dense Connections · Attention Is All You Need · Adam · Residual Connection · Position-Wise Feed-Forward Layer · Label Smoothing · Byte Pair Encoding