MEDNA-DFM: A Dual-View FiLM-MoE Model for Explainable DNA Methylation Prediction
Yi He (1, 4), Yina Cao (2), Jixiu Zhai (3, 4), Di Wang (1, 4), Junxiao Kong (4), Tianchi Lu (4, 5) ((1) Cuiying Honors College, Lanzhou University, Lanzhou, Gansu, China, (2) School of Management, Lanzhou University, Lanzhou, Gansu, China, (3) Shanghai Innovation Institute

TL;DR
This paper introduces MEDNA-DFM, an explainable deep learning model for DNA methylation prediction that captures conserved motifs and generates biological hypotheses, outperforming prior methods in reliability and interpretability.
Contribution
The study presents MEDNA-DFM, a novel dual-view FiLM-MoE model with mechanism-inspired algorithms, enhancing interpretability and motif extraction in DNA methylation prediction.
Findings
MEDNA-DFM accurately distinguishes methylation patterns across species.
The model's generalization relies on conserved motifs like GC content.
Motif extraction significantly outperforms previous methods.
Abstract
Accurate computational identification of DNA methylation is essential for understanding epigenetic regulation. Although deep learning excels in this binary classification task, its "black-box" nature impedes biological insight. We address this by introducing a high-performance model MEDNA-DFM, alongside mechanism-inspired signal purification algorithms. Our investigation demonstrates that MEDNA-DFM effectively captures conserved methylation patterns, achieving robust distinction across diverse species. Validation on external independent datasets confirms that the model's generalization is driven by conserved intrinsic motifs (e.g., GC content) rather than phylogenetic proximity. Furthermore, applying our developed algorithms extracted motifs with significantly higher reliability than prior studies. Finally, empirical evidence from a Drosophila 6mA case study prompted us to propose a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗hy-0003/MEDNA-DFM_5hmC_H.sapiensmodel· 7 dl7 dl
- 🤗hy-0003/MEDNA-DFM_5hmC_M.musculusmodel· 1 dl1 dl
- 🤗hy-0003/MEDNA-DFM_4mC_C.equisetifoliamodel· 2 dl2 dl
- 🤗hy-0003/MEDNA-DFM_4mC_F.vescamodel· 1 dl1 dl
- 🤗hy-0003/4mC_S.cerevisiaemodel
- 🤗hy-0003/MEDNA-DFM_4mC_Tolypocladiummodel· 2 dl2 dl
- 🤗hy-0003/MEDNA-DFM_6mA_A.thalianamodel· 1 dl1 dl
- 🤗hy-0003/MEDNA-DFM_6mA_C.elegansmodel· 1 dl1 dl
- 🤗hy-0003/MEDNA-DFM_6mA_C.equisetifoliamodel
- 🤗hy-0003/MEDNA-DFM_6mA_D.melanogastermodel· 2 dl2 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEpigenetics and DNA Methylation · Machine Learning in Bioinformatics · Genomics and Chromatin Dynamics
