# Off-target sequence variations driven by the intrinsic properties of the Cas–sgRNA–DNA complex in genome editing

**Authors:** Celine Kurniawan, Takeshi Itoh, Hodaka Fujii, Hodaka Fujii, Hodaka Fujii, Hodaka Fujii

PMC · DOI: 10.1371/journal.pone.0328905 · 2025-07-18

## TL;DR

This study explores how off-target mutations in genome editing are influenced by the Cas–sgRNA–DNA complex's intrinsic properties, aiming to improve prediction tools.

## Contribution

The study introduces a new method to assess sequence pattern similarity and diversity in off-target mutations using relative entropy.

## Key findings

- Off-target sequence patterns are similar across different experimental conditions, indicating intrinsic Cas–sgRNA–DNA properties.
- Newly engineered enzymes and bacterial sources can produce unique off-target patterns.
- Three methods showed clear correlations, validating their use in analyzing off-target mutations.

## Abstract

Genome-editing technologies hold significant potential across various biotechnological fields, yet concerns about possible risks, including off-target mutations, remain. To ensure safe and effective application, these unintended mutations must be rigorously examined and minimized. Computational approaches are anticipated to streamline the detection of off-target mutations; however, the performance of current prediction tools is limited, likely owing to insufficient knowledge of off-target mutation characteristics. In this study, we collected experimentally validated off-target mutation data and conducted a large-scale analysis of 177 nonredundant datasets obtained from six studies. We developed a method to assess the statistical significance of sequence pattern similarity and diversity between off-target sites. This method is based on a comparison of ordered relative entropy values for aligned target sequences, and it was compared with two other methods on the basis of Euclidean distance and the Pearson correlation coefficient. The three methods demonstrated clear correlations, indicating their validity. These methods were applied to 238 dataset pairs for the same target site, and it was revealed that off-target sequence patterns were quite similar across different experimental conditions, such as varying cell lines and independent experiments, suggesting that the intrinsic properties of the Cas–sgRNA–DNA complex play a key role in determining cleavage sites. However, newly engineered enzymes and those from different bacterial sources occasionally display unique off-target patterns, indicating the need for comprehensive evaluation of each new enzyme to develop reliable prediction tools. The insights gained from this study are expected to contribute to a better understanding of off-target mutation characteristics and support the development of more accurate computational prediction methods.

## Linked entities

- **Proteins:** CSE1L (chromosome segregation 1 like)

## Full-text entities

- **Genes:** HBB (hemoglobin subunit beta) [NCBI Gene 3043] {aka CD113t-C, ECYT6, beta-globin}, FANCF (FA complementation group F) [NCBI Gene 2188] {aka FAF}, BCAR1 (BCAR1 scaffold protein, Cas family member) [NCBI Gene 9564] {aka CAS, CAS1, CASS1, CRKAS, P130Cas}, CCR5 (C-C motif chemokine receptor 5) [NCBI Gene 1234] {aka CC-CKR-5, CCCKR5, CCR-5, CD195, CKR-5, CKR5}
- **Diseases:** leukemia (MESH:D007938), cardiovascular diseases (MESH:D002318), cancer (MESH:D009369), toxicities (MESH:D064420), WT (MESH:D006969), sickle cell anemia (MESH:D000755), SLIDINGWINDOW:10:20 (MESH:C557827), neurodegenerative disorders (MESH:D019636)
- **Chemicals:** Cas12a (-), gamma-aminobutyric acid (MESH:D005680)
- **Species:** Bos taurus (bovine, species) [taxon 9913], Homo sapiens (human, species) [taxon 9606]
- **Mutations:** CCAGAAGCT with TTTTTTTTA
- **Cell lines:** HEKsite4 — Homo sapiens (Human), Ataxia telangiectasia syndrome, Finite cell line (CVCL_F083), U2OS — Homo sapiens (Human), Osteosarcoma, Cancer cell line (CVCL_0042), HEK293site2 — Homo sapiens (Human), Transformed cell line (CVCL_0045), HEK293T — Homo sapiens (Human), Transformed cell line (CVCL_0063), HEK293T/17 — Homo sapiens (Human), Transformed cell line (CVCL_1926), SpCas9 — Homo sapiens (Human), Induced pluripotent stem cell (CVCL_RG56)

## Figures

16 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12273960/full.md

---
Source: https://tomesphere.com/paper/PMC12273960