Copy Number Variants and Segmental Duplications Show Different Formation Signatures
Philip M. Kim, Jan O. Korbel, Xueying Chen, Mark B. Gerstein

TL;DR
This study investigates the formation mechanisms of Copy Number Variants and Segmental Duplications, revealing distinct patterns and associations with genomic elements, and suggesting different underlying processes for their formation.
Contribution
It provides a comparative analysis of CNVs and SDs, highlighting their different formation signatures and associations with genomic repeats, which advances understanding of their evolutionary mechanisms.
Findings
SDs are significantly co-localized, following a power-law distribution.
Older SDs have a stronger association with Alu elements than younger SDs.
CNVs show no association with Alu elements, unlike SDs.
Abstract
In addition to variation in terms of single nucleotide polymorphisms (SNPs), whole regions ranging from several kilobases up to a megabase in length differ in copy number among individuals. These differences are referred to as Copy Number Variants (CNVs) and extensive mapping of these is underway. Recent studies have highlighted their great prevalence in the human genome. Segmental Duplications (SDs) are long (>1kb) stretches of duplicated DNA with high sequence identity. First, we analyzed the co-localization of SDs and find that SDs are significantly co-localized with each other, resulting in a power-law distribution, which suggests a preferential attachment mechanism, i.e. existing SDs are likely to be involved in creating new ones nearby. Second, we look at the relationship of CNVs/SDs with various types of repeats. We we find that the previously recognized association of SDs with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomic variations and chromosomal abnormalities · Fractal and DNA sequence analysis · Algorithms and Data Compression
