Genomic Balancing Act: Deciphering DNA rearrangements in the Complex Chromosomal Aberration involving 5p15.2, 2q31.1 and 18q21.32
James Lupski, Zain Dardas, Dana Marafi, Ruizhi Duan, Jawid Fatih, Omnia El-Rashidy, Christopher Grochowski, Claudia Carvalho, Shalini Jhangiani, Weimin Bi, Haowei Du, Richard Gibbs, Jennifer Posey, Daniel Calame, Maha Zaki

TL;DR
This study identifies a complex chromosomal rearrangement causing a neurodevelopmental disorder by combining genomic technologies to analyze a family's genetic inheritance.
Contribution
The study reveals a novel chromoplexy mechanism involving multiple chromosomal breakpoints and inheritance patterns in a neurodevelopmental disorder.
Findings
A complex chromosomal rearrangement involving 5p15.2, 2q31.1, and 18q21.32 was identified in a patient with a neurodevelopmental disorder.
Breakpoint junction analyses revealed four distinct junctions in the father, who carries a balanced chromosomal rearrangement.
The rearrangement likely arose via chromoplexy, involving DNA repair mechanisms like NHEJ or alt-EJ.
Abstract
Despite extensive research into the genetic underpinnings of neurodevelopmental disorders (NDD), many clinical cases remain unresolved. We studied a female proband with a NDD, mildly dysmorphic facial features, and brain stem hypoplasia on neuroimaging. Comprehensive genomic analyses revealed a terminal 5p loss and terminal 18q gain in the proband while a diploid copy number for chromosomes 5 and 18 in both parents. Genomic investigations in the proband identified an unbalanced translocation t(5;18) with additional genetic material from chromosome 2 (2q31.3) inserted at the breakpoint, pointing to a complex chromosomal rearrangement (CCR) involving 5p15.2, 2q31.3, and 18q21.32. Breakpoint junction analyses enabled by long read genome sequencing unveiled the presence of four distinct junctions in the father, who is carrier of a balanced CCR. The proband inherited from the father both the…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomic variations and chromosomal abnormalities · Genomics and Chromatin Dynamics · Chromosomal and Genetic Variations
Introduction
Complex chromosomal rearrangements (CCRs) are intriguing genetic phenomena that challenge our understanding of the human genome’s structural integrity. These genomic DNA rearrangements involve intricate and often unexpected alterations in the organization of chromosomal material. CCRs represent a subclass of structural variations, distinct from simple translocations or unique locus simple DNA rearrangements (1).
In CCRs, multiple double-strand breaks (DSBs) are hypothesized to occur in different regions of the genome on 3 or more chromosomes, setting the stage for a complex and intricate genetic jigsaw puzzle (2, 3). These genomic DNA breaks can be caused by various factors, including DNA damage or loss of genome integrity during DNA replicative repair mechanisms (2). What ensues is a genomic reshuffling, characterized by the interplay of chromosome translocations, deletions, insertions, and inversions. This intricate choreography of genetic material challenges our conventional understanding of linear chromosomal sequences.
Understanding mechanisms for CCR derivation hold profound clinical significance, as they can be associated with a range of genetic disorders, including neurodevelopmental disorders, and have implications for inheritance and for both family counseling and prognosis. Therefore, investigating CCRs is vital for both diagnosing and comprehending the genetic basis of these conditions. However, despite their clinical significance for proband and family, identifying CCRs remains a formidable challenge in the fields of genetics and clinical cytogenetics. The intricate nature of these genome rearrangements, with multiple breakpoint junctions and various types of structural alterations, can challenge interpretations in the clinical diagnostic laboratory.
This study investigates a family with a female child proband who presented with a neurodevelopmental disorder, accompanied by subtle facial anomalies. Despite extensive prior research investigations, her condition and the genetic model for the family remained an enigma, prompting a multifaceted technological exploration of her genomic landscape. The investigation initiated with a comprehensive examination of her genome, employing a multi-tier approach that included several genomic technologies, pointing to a CCR involving not one but three distinct chromosome regions: 5p15.2, 2q31.3, and 18q21.32. We provide evidence implicating an underlying mechanism for the CCR, its clinical implications for the family, and the impact of gene dosage imbalance on the proband.
Methods
Patient Enrollment
This study adhered to the principles of the Declaration of Helsinki. The family trio (proband, mother, and father) were enrolled under a protocol approved by the institutional review board (IRB) at Baylor College of Medicine (BCM) (H-29697). This family was identified through the Baylor-Hopkins Center for Mendelian Genomics (BHCMG) and BCM Genomics Research Elucidates the Genetics of Rare Disease (BCM-GREGoR) database and as part of the analysis of a large Middle Eastern and North African (MENA) cohort with rare neurodevelopmental disorders.
Exome sequencing
Exome sequencing (ES) was performed at the Baylor College of Medicine Human Genome Sequencing Center (BCM-HGSC) with an Illumina dual indexed, paired-end pre-capture library per the manufacturer’s protocol with previously described modifications (4). Paired-end sequencing was performed with the Illumina NovaSeq6000 platform. Samples achieved 98% of the targeted exome bases covered to a depth of 20× or greater and had a sequencing yield of 13.2 Gb. Illumina sequence analysis was performed with the HGSC HgV analysis pipeline, which moves data through various analysis tools from the initial sequence generation on the instrument to annotated variant calls (SNPs and intra-read in/dels) (5, 6). CNV analyses were implemented using read count from short read (SR) ES data that were normalized for log2 ratio calculation by the best reference approach using our in-house algorithm HMZdupFinder (7) and using XHMM (eXome-Hidden Markov Model) (8).
High-density array comparative genomic hybridization (aCGH)
High-resolution aCGH, using a 1 million probe whole-genome oligonucleotide microarray (Agilent microarray design ID:085903), was performed on all family members. All array-based experiments were implemented according to the Agilent aCGH protocol for probe labeling and hybridization with minor modifications (9).
Nanopore long-read sequencing
Long read (LR) sequencing libraries were generated using the Oxford Nanopore technology (ONT) ligation sequencing kit and then sequenced with the PromethION Beta platform. Sequencing depth was calculated on the resulting alignments using mosdepth v0.2.38 with the parameters ‘-F 3588 -Q 1’, which calculated coverage of depth in 100 bp bins and included only primary and supplemental alignments. Alignment of long read DNA sequencing was performed with NGMLR v0.2.79 using default parameters along with the ‘-bam -fix’ parameter for long CIGAR string support.
Breakpoint junction amplification and sequencing analysis
Soft-clipped reads overlapping breakpoint junctions were extracted from LR sequence alignment files and remapped to the human genome (GRCh38) with the UCSC BLAT tool to single base-pair resolution. Primers were designed upstream and downstream of the identified junction and PCR amplification was performed using the HotStarTaq (Qiagen) polymerase with standard conditions. The amplified DNA rearrangement breakpoint junctions were confirmed by Sanger dideoxynucleotide sequencing.
Results
Case Presentation
The female proband is the first child born to non-consanguineous healthy parents (29-year-old mother and 30-year-old father) at 39 weeks gestational age, by cesarean section (Fig. 1a–b). Her birth weight was 2.8 kg (-1.2SD), length was 47 cm (-1.1SD), and occipital frontal circumference (OFC) was 31.5 cm (-2.2SD). She was admitted to the neonatal intensive care unit (NICU) for three days due to exaggerated physiological jaundice.
At 9 months of age, this infant presented a clinical picture marked by a constellation of distinctive features. She exhibited global developmental delay, characterized by an inability to support her head and a lack of crawling abilities, as well as an inability to recognize her mother. On physical examination, anthropometric measurements showed a weight of 7.2kg (-1.4SD), a length of 66cm (-1.3SD), and a head circumference of 38.5 cm (-4.4SD). Her facial features were distinctive but nonspecific for any recognizable pattern of human malformation: brachycephaly, a high forehead, arched eyebrows, epicanthic folds, wide palpebral fissures, upturned nose with a flat nasal tip, hypoplastic mandible and low-set large ears with elevated ear lobules. A unilateral simian crease was noted, and neurological assessment showed axial hypotonia, mild limb hypertonia, and brisk reflexes. MRI showed a strikingly thin brain stem especially the pons, relatively small cerebellum in addition to increased extra-axial CSF, mild supratentorial ventricular dilatation, and underdeveloped hippocampus (Fig. 1b). Additionally, the proband’s medical history is notable for congenital heart disease (CHD), specifically a patent ductus arteriosus (PDA) and patent foramen ovale (PFO), for which she was admitted at 2 months of age for a 20-day hospitalization. Cardiovascular evaluation by echocardiogram showed a PDA measuring 1.2mm and a PFO also measuring 1.2mm. Electroencephalogram (EEG) results remain within the normal range, reflecting typical brain electrical activity.
Initial G-banding analysis revealed a normal karyotype of 46,XX. Subsequent trio Exome analysis did not identify evidence for disease-associated single nucleotide variants (SNVs) or insertions/deletions (indels).
Copy number variation (CNV) analysis and validation
Subsequent analysis of the extant exome data, utilizing the HMZDupFinder (7) and XHMM tools (8) for the detection of copy number variations (CNVs), unveiled the presence of multiple seemingly de novo CNV deletions on chromosome 5p and duplications on chromosome 18q (Fig. 1c). To further interrogate these findings, a high-resolution aCGH was employed using a 1 million probe whole-genome oligonucleotide microarray (Agilent microarray design ID: 085903). Intriguingly, this advanced genome analysis demonstrated a diploid copy number status for chromosomes 5 and 18 in both parents, while the proband exhibited a terminal loss of 5p15.33–5p15.2 and a terminal gain on 18q21.32–18q23 (Fig. 1d). These molecular findings suggested the possibility of an unbalanced translocation t(5;18), a hypothesis corroborated by the subsequent repetition of karyotyping, which confirmed the presence of additional genetic material on 5p (Fig. 1e).
Evidence for Chromosome 2 interchromosomal insertion
We performed trio long read ONT sequencing to unravel the architecture and the orientation of the translocated genomic regions. Surprisingly, breakpoint mapping revealed that the apparent translocation (5;18) also included chromosome 2 genomic material (1 Mb mapping to 2q31.3) and provided evidence for a complex chromosomal rearrangement (CCR) between 5p15.2, 2q31.3, and 18q21.32 (Fig. 2). Since the 2q31.3 material was translocated in a balanced state in the proband, ES read depth data showed no abnormalities in the proband’s chr2 ES data (Fig.S1). ONT data also showed that the CCR was inherited from the father who has the CCR in a balanced state.
Genomic Rearrangement Architecture and Breakpoint Junction Sequences
ONT data revealed the presence of three distinct breakpoint junction sequences in both the proband and her father. Junction 1 connects the terminal portion of the deleted segment on chromosome 5p (Fig. 2a) with the proximal site of the inserted 2q31.3 genomic region; notably, 2q31.3 is connected in an inverted orientation (Fig. 2b). This junction presents a blunt end fusion between 5p and 2q31.3. The distal side of the 2q31.3 (chr2:181767345, hg38) is then connected to the beginning of the duplicated region in 18q in a direct orientation. Additionally, the junction reveals the insertion of a 14 bp segment (junction 2).
To investigate the third junction on chromosome 2, where the 2q31.3 segment was excised (Fig. 2c), we designed inward-facing primers. This junction exhibited a 1 bp microhomology. Since the father carries a balanced rearrangement, a fourth junction was exclusively detected in his ONT data. This junction connects the segment of 5p, which is deleted in the proband, with the 18q21.32 in an inverted orientation (Fig. 2d).
All breakpoints identified through ONT sequencing underwent validation via PCR amplification and subsequent Sanger sequencing. Gel electrophoresis of the PCR products (Fig. 2e) corroborated the inheritance of the CCR from the father, who maintains the translocation in a balanced state. This is evident from the presence of the fourth junction. Conversely, no bands were observed in the mother for any of the junctions, confirming the absence of the CCR in her genetic profile (Fig. 3).
Chromoplexy complex genomic rearrangement
The CCR potentially occurred as a result of different double strand breaks (DSBs) that could be repaired by non-homologous end joining (NHEJ) or alternative end joining (alt-EJ) and be arranged into various derivative configurations: der(2), der(5), der(18). The full structure and the alignment of all the breakpoint junction sequences suggest chromoplexy as an underlying mechanism. Since the father carries the CCR in a balanced manner, possible chromosomal complement in the father’s gametes is 46,XY,seq[GRCh38] der(2)ins(5;2)(p15.2;q31.3q31.3),der(5)ins(5;2)t(5;18)(p15.2;q21.32),der(18)t(5;18)
NC_000002.12:g.180431443_181767356del NC_000005.10:g.pter_14511074delins[NC_0000018.10:g.58855275_qterinv; GGTTGGATAAGGTG;NC_000002.12:g.180431448_181767345inv]
NC_0000018.10:g.58855322_qterdelins[NC_000005.10:g.pter_14511072inv].
At meiosis I, translocated chromosomes and their normal homologues synapse to form a quadrivalent. Therefore, unbalanced gametes are produced by adjacent segregation from a quadrivalent. Fertilization with a normal gamete produce the proband’s genome with unbalanced CCR: 46,XX,seq[GRCh38] der(2)ins(5;2)(p15.2;q31.3q31.3),der(5)ins(5;2)t(5;18)(p15.2;q21.32)
NC_000002.12:g.180431443_181767356del
NC_000005.10:g.pter_14511074delins[NC_0000018.10:g.58855275_qterinv;N[14];NC_000002.12:g.180431448_181767345inv].
Discussion
Despite significant advancements in the fields of genetics and genomics, and extensive research into the genetics of neurodevelopmental disorders, many cases and families remain unsolved. The proband in this study exhibited a constellation of clinical features, including global developmental delay, craniofacial anomalies, and CHD. These clinical findings raised suspicions of an underlying genetic cause. While initial karyotyping and exome analysis did not reveal any single gene mutations, subsequent investigations showed a CCR involving chromosomes 2, 5, and 18, further highlighting the importance of considering structural chromosomal abnormalities in cases of unexplained phenotypes.
The mechanism(s) underlying CCR formation remains elusive; some studies propose models based upon the principle of parsimony and the minimum number of breaks required for the formation of the CCR (10,11). Since this CCR involves three chromosomes and the presence of blunt fusions, direct orientations, and inverted orientations in the junction sequences indicates that the rearrangement is highly complex, consistent with chromoplexy (Fig. 4a). It is a complex genomic rearrangement mechanism that involves multiple double-strand breaks (DSBs) occurring in different regions of the genome (12). It often leads to a series of intricate chromosomal rearrangements and is characterized by the interplay of translocations, deletions, insertions, and inversions. In the context of the CCR described the presence of multiple derivative chromosomes: namely der(2), der(5), der(18), suggests that several DSBs occurred simultaneously, or in close succession, resulting in the intricate rearrangement observed. The CCR’s involvement of chromosomes 2, 5, and 18 implies that these chromosomes were connected during the rearrangement process. Chromoplexy often forms multiple connections between different chromosomes, leading to the exchange of genetic material. This complexity arises from the diverse ways in which the DSBs are repaired, and the genomic regions are rearranged.
Our breakpoint data support the contention that NHEJ and/or alt-EJ mechanisms were involved in the repair of the DSBs.These mechanisms involve the direct joining of broken ends of DNA, often without the need for homologous sequences, and can lead to insertions, deletions, or inversions (13). The father’s balanced rearrangement implies that NHEJ or alt-EJ was precise enough to create a stable, balanced configuration in his genome. However, the rearrangement became unbalanced when transmitted to the proband (Fig.4b). This CCR complies with most aspects of chromoplexy, including the involvement of >2 chromosomes, the absence of sequence homology at their fusion breakpoints (14). However, the fact that the breakpoints are not within transcriptionally active areas, which is usually the case in chromoplexy (14,15), stands against our hypothesis of chromoplexy-type event. Chromoplexy, chromothripsis, and chromoanasynthesis are designated terms that describe the phenomenology of the observed genome changes. The term chromoanagenesis, i.e., chromosome rebirth, encompasses the phenomena of extensive rearrangement occurring in a single cell burst (14–16). Chromothripsis could also be suggested as underlying mechanism of the CCR, given the lack of sequence homology at the breakpoints or only microhomology of a few nucleotides usually observed in chromothripsis similar to chromoplexy (15). Chromothripsis and chromoplexy were first characterized in cancer genomes, however, they also have been shown to underlie Mendelian diseases and genomic disorders (17,18).
Since the proband’s genome carries this CCR in an unbalanced manner, this suggests an altered gene dosage for specific genes and regions of 5p loss and 18q gain involved in the CCR; the observation of a normal 2q31.3 dosage and absence of one junction fragment present in the father is consistent with the segregation of the paternal homologue with the deleted 2q31.3. This imbalance in the proband leads to changes in the expression of the genes mapping within the chr5p and chr18q dosage altered regions and can significantly impact the individual’s phenotype. The father, on the other hand, carries the same CCR but in a balanced state, which explains the absence of the clinical phenotype that is observed in the child. This contrast highlights the importance of maintaining the proper gene dosage for normal development and function (19).
Several dosage-sensitive genes have been identified in the chr5p region that may contribute to the proband’s phenotype. For example, five genes (TERT, SEMA5A, MARCH6, CTNND2, NPR3) are classified as dosage sensitive leading to haploinsufficiency (20). Perhaps TERT haploinsufficiency could contribute to perturbations of telomere biology underlying some CCR. It is firmly established that the chr5p deletion is responsible for Cri du Chat syndrome (MIM#123450), characterized by a distinctive cat-like cry (21), while the duplication of 18q leads to Edwards syndrome (22,23). These are rare chromosomal syndromes, each associated with distinct physical and mental impairments. However, our case presents a unique clinical scenario, showcasing a blended phenotype that may exhibit some overlap with both Cri du chat and Edwards syndrome. This includes distinctive facies, involving a high forehead, arched eyebrows, epicanthic folds, wide palpebral fissures, and a hypoplastic mandible. These features are consistent with those seen in Cri du Chat syndrome and share some similarities with the facial features seen in Edwards syndrome (24), but the blending or mixture precludes diagnosis of either; particularly in the context of an initial ‘normal karyotype’. Global developmental delay, axial hypotonia, and other neurological findings are also observed in both Cri du Chat and Edwards syndrome.
Moreover, the child has a history of CHD, specifically a PDA and PFO. While CHD is not a hallmark feature of Cri du Chat syndrome, it can be present in some cases (24).
Taken together, the case stands as an exemplar of the intricate nature of chromosomal rearrangements and complexity of genome mutation, underscoring the importance of employing a diverse array of genomic technologies to untangle the consequences for both patient and family.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Zhang F, Carvalho CMB, Lupski JR. Complex human chromosomal and genomic rearrangements. Trends Genet [Internet]. 2009 Jul [cited 2023 Nov 2];25(7):298. Available from: /pmc/articles/PMC 4464790/19560228 10.1016/j.tig.2009.05.005PMC 4464790 · doi ↗ · pubmed ↗
- 2Cannan WJ, Pederson DS. Mechanisms and Consequences of Double-strand DNA Break Formation in Chromatin. J Cell Physiol [Internet]. 2016 Jan 1 [cited 2023 Nov 2];231(1):3. Available from: /pmc/articles/PMC 4994891/26040249 10.1002/jcp.25048 PMC 4994891 · doi ↗ · pubmed ↗
- 3Pellestor F, Anahory T, Lefort G, Puechberty J, Liehr T, Hedon B, Complex chromosomal rearrangements: origin and meiotic behavior. Hum Reprod Update [Internet]. 2011 Jul [cited 2023 Nov 6];17(4):476–94. Available from: https://pubmed.ncbi.nlm.nih.gov/21486858/21486858 10.1093/humupd/dmr 010 · doi ↗ · pubmed ↗
- 4Eldomery MK, Coban-Akdemir Z, Harel T, Rosenfeld JA, Gambin T, Stray-Pedersen A, Lessons learned from additional research analyses of unsolved clinical exome cases. Genome Medicine 2017 9:1 [Internet]. 2017 Mar 21 [cited 2023 Oct 26];9(1):1–15. Available from: https://genomemedicine.biomedcentral.com/articles/10.1186/s 13073-017-0412-610.1186/s 13073-017-0412-6PMC 536181328327206 · doi ↗ · pubmed ↗
- 5Challis D, Yu J, Evani US, Jackson AR, Paithankar S, Coarfa C, An integrative variant analysis suite for whole exome next-generation sequencing data. BMC Bioinformatics [Internet]. 2012 Jan 12 [cited 2023 Oct 26];13(1):1–12. Available from: https://bmcbioinformatics.biomedcentral.com/articles/10.1186/1471-2105-13-822239737 10.1186/1471-2105-13-8PMC 3292476 · doi ↗ · pubmed ↗
- 6Reid JG, Carroll A, Veeraraghavan N, Dahdouli M, Sundquist A, English A, Launching genomics into the cloud: Deployment of Mercury, a next generation sequence analysis pipeline. BMC Bioinformatics [Internet]. 2014 Jan 29 [cited 2023 Oct 26];15(1):1–11. Available from: https://bmcbioinformatics.biomedcentral.com/articles/10.1186/1471-2105-15-3024475911 10.1186/1471-2105-15-30PMC 3922167 · doi ↗ · pubmed ↗
- 7Du H, Dardas Z, Jolly A, Grochowski CM, Jhangiani SN, Li H, HMZ Dup Finder: a robust computational approach for detecting intragenic homozygous duplications from exome sequencing data. Nucleic Acids Res [Internet]. 2023;gkad 1223. Available from: 10.1093/nar/gkad 1223 PMC 1089979438153174 · doi ↗ · pubmed ↗
- 8Fromer M, Moran JL, Chambert K, Banks E, Bergen SE, Ruderfer DM, Discovery and statistical genotyping of copy-number variation from whole-exome sequencing depth. Am J Hum Genet [Internet]. 2012 Oct 5 [cited 2024 Jan 14];91(4):597–607. Available from: https://pubmed.ncbi.nlm.nih.gov/23040492/23040492 10.1016/j.ajhg.2012.08.005PMC 3484655 · doi ↗ · pubmed ↗
