# Unintended Creation or Insertion of Antisense Promoter Motifs During Codon Optimization: A Cyber-Biosecurity Risk

**Authors:** Elad Carmi, Roni Glikman, Yuval Dorfan

PMC · DOI: 10.3390/microorganisms14030638 · Microorganisms · 2026-03-12

## TL;DR

This paper reveals that codon optimization can accidentally create harmful antisense promoter motifs, posing a cyber-biosecurity risk in synthetic biology.

## Contribution

The study introduces a computational pipeline to detect and prevent unintended antisense promoter insertions during codon optimization.

## Key findings

- Only 4.8% of sequences naturally contained antisense promoter motifs, but 77.28% of motif-free sequences allowed silent insertions.
- A computational pipeline was developed to scan and prevent unintended antisense motif insertions in DNA sequences.
- The findings highlight a cyber-biosecurity vulnerability in DNA design pipelines requiring bi-directional screening.

## Abstract

Codon optimization is a cornerstone technique in synthetic biology and biotechnological production, aimed at enhancing heterologous protein expression through synonymous codon substitutions. While optimization traditionally focuses on forward-strand translation efficiency, its impact on the complementary DNA strand is not always carefully examined. In this study, we investigate whether codon optimization inadvertently introduces antisense motifs, specifically bacterial antisense promoter (e.g., “TATAAT”), and whether such motifs can be silently inserted into coding sequences on purpose without altering protein output. We developed a computational pipeline that (i) scans optimized sequences for antisense motifs. These could be either natural or synthetic unintended motifs; (ii) implements a silent insertion algorithm that preserves amino acid sequence; and (iii) evaluates insertion feasibility across a large genomic dataset. These components can also lead to useful scanning of synthetic sequences, before they are synthesized or ordered. It has the potential to save a great deal of time and money that might be spent in wet labs that are using the wrong sequences. Their experiments often fail due to predictable reasons, while these failures can be avoided using the software (SW) we developed, which is published here as an open source for academic and industrial usage. In a dataset of 484,741 protein-coding sequences, only 4.8% naturally contained the motif, yet 77.28% of motif-free sequences permitted silent insertions. We extend these findings with codon bias analysis, derive analytical bounds for insertion complexity, and propose computational defense strategies. These results uncover a novel cyber-biosecurity vulnerability in DNA design pipelines, emphasizing the need for bi-directional screening in codon optimization tools.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13029128/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC13029128/full.md

## References

29 references — full list in the complete paper: https://tomesphere.com/paper/PMC13029128/full.md

---
Source: https://tomesphere.com/paper/PMC13029128