Transform-Domain Classification of Human Cells based on DNA Methylation Datasets
Xueyuan Zhao, Dario Pompili

TL;DR
This paper introduces a transform-domain classification method using Walsh-Hadamard Transform on DNA methylation data to efficiently distinguish between normal and cancerous human cells, achieving faster computation with maintained accuracy.
Contribution
The study presents a novel WHT-based pipeline for classifying human cells from DNA methylation data, significantly reducing computation time while preserving accuracy.
Findings
WHT-based method speeds up classification by over ten times.
Maintains comparable accuracy to traditional sequence classification.
Applicable to various human cancer types and datasets.
Abstract
A novel method to classify human cells is presented in this work based on the transform-domain method on DNA methylation data. DNA methylation profile variations are observed in human cells with the progression of disease stages, and the proposal is based on this DNA methylation variation to classify normal and disease cells including cancer cells. The cancer cell types investigated in this work cover hepatocellular (sample size n = 40), colorectal (n = 44), lung (n = 70) and endometrial (n = 87) cancer cells. A new pipeline is proposed integrating the DNA methylation intensity measurements on all the CpG islands by the transformation of Walsh-Hadamard Transform (WHT). The study reveals the three-step properties of the DNA methylation transform-domain data and the step values of association with the cell status. Further assessments have been carried out on the proposed machine learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEpigenetics and DNA Methylation · Gene expression and cancer classification · Machine Learning in Bioinformatics
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
