Identifying OCRs in cfDNA WGS Data by Correlation Clustering
Farshad Noravesh, Fahimeh Palizban

TL;DR
This paper presents a novel correlation clustering method to identify open chromatin regions (OCRs) from cfDNA WGS data, enabling tissue-specific insights with low input requirements and high accuracy.
Contribution
The study introduces a correlation clustering algorithm that predicts OCRs from WGS data using local sequencing depth, reducing the need for multiple experimental modalities.
Findings
Over 67% overlap with validated OCRs from ATAC-db
Effective prediction of OCRs from low-input WGS data
Potential for cost-effective tissue origin analysis in liquid biopsy
Abstract
In the recent decade, the emergence of liquid biopsy has significantly improved cancer monitoring and detection. Dying cells, including those originating from tumors, shed their DNA into the bloodstream and contribute to a pool of circulating fragments called cell-free DNA (cfDNA). Identifying the tissue origin of these DNA fragments from their epigenetic features has implications in various clinical contexts. Open chromatin regions (OCRs) are important epigenetic features of DNA that reflect cell types of origin. Profiling these features by DNase-seq, ATAC-seq, and histone ChIP-seq provides insights into tissue-specific and disease-specific regulatory mechanisms. Integration of genomic and epigenomic features for cancer detection by liquid biopsy has previously been reported. However, many multimodal analyses require large amounts of cfDNA input and/or multiple types of experiments to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCancer Genomics and Diagnostics · Epigenetics and DNA Methylation · Cancer-related molecular mechanisms research
MethodsBottleneck Attention Module
