# FSFT6mA: a feature-synthesis fine-tuning framework for DNA 6mA site prediction

**Authors:** Hong-Jin Yu, Ying Zhang, Dong-Jun Yu, Guansheng Zheng

PMC · DOI: 10.3389/fgene.2025.1750223 · Frontiers in Genetics · 2026-01-12

## TL;DR

This paper introduces FSFT6mA, a deep learning framework that improves DNA 6mA site prediction by generating synthetic features using GANs.

## Contribution

The novel use of GANs for feature synthesis in DNA 6mA site prediction significantly enhances model performance.

## Key findings

- FSFT6mA improves MCC by 2.6% on A. thaliana and 1.9% on D. melanogaster.
- The model achieves AUC values of 0.969 and 0.968 on A. thaliana and D. melanogaster, respectively.
- GAN-generated features lead to superior performance compared to existing predictors.

## Abstract

DNA N6-methyladenine (6mA) is an important epigenetic modification that plays a critical role in gene expression regulation and has been associated with diverse biological processes and diseases. Accurate identification of 6mA sites is essential for understanding its functional significance. Although an increasing number of computational approaches have been proposed, they almost exclusively rely on sequence-derived features. The potential of novel feature representations to further enhance predictive performance remains an important research problem.

In this study, we propose FSFT6mA, a novel deep learning-based framework designed to improve 6mA site prediction through feature synthesis. The model is initially trained on the original datasets using a deep convolutional neural network. Subsequently, a Generative Adversarial Network (GAN) is employed to generate synthetic features from intermediate network layers, which are then used to fine-tune the well-trained model in the first stage.

Incorporating GAN-generated features leads to notable performance gains, improving MCC by 2.6% on A. thaliana and 1.9% on D. melanogaster compared with the base models without synthetic features. Independent validation experiments demonstrate that FSFT6mA achieves superior performance compared to existing state-of-the-art predictors, attaining AUC values of 0.969 and 0.968 on A. thaliana and D. melanogaster, respectively.

These results indicate that FSFT6mA is an accurate tool for DNA 6mA site prediction. The data and the codes used in this study are freely accessible on GitHub (https://github.com/YuHong-Jin/FSFT6mA).

## Full-text entities

- **Chemicals:** N6-methyladenine (MESH:C005955), 6mA (-)
- **Species:** Drosophila melanogaster (fruit fly, species) [taxon 7227], Arabidopsis thaliana (mouse-ear cress, species) [taxon 3702]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12832111/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12832111/full.md

## References

44 references — full list in the complete paper: https://tomesphere.com/paper/PMC12832111/full.md

---
Source: https://tomesphere.com/paper/PMC12832111