# Targeted phasing of 2–200 kilobase DNA fragments with a short-read sequencer and a single-tube linked-read library method

**Authors:** Veronika Mikhaylova, Madison Rzepka, Tetsuya Kawamura, Yu Xia, Peter L. Chang, Shiguo Zhou, Amber Paasch, Long Pham, Naisarg Modi, Likun Yao, Adrian Perez-Agustin, Sara Pagans, T. Christian Boles, Ming Lei, Yong Wang, Ivan Garcia-Bassets, Zhoutao Chen

PMC · DOI: 10.1038/s41598-024-58733-0 · Scientific Reports · 2024-04-05

## TL;DR

This paper introduces a new method called TELL-Seq that allows scientists to determine which DNA variants come from each parent using short-read sequencing for DNA fragments up to 200 kilobases long.

## Contribution

The novel contribution is a targeted TELL-Seq protocol that enables accurate phasing of 2–200 kb DNA loci using short-read sequencing.

## Key findings

- TELL-Seq can phase DNA fragments up to 200 kb using short-read sequencing.
- The method was validated on clinically relevant genes like BRCA1 and BRCA2.
- Different enrichment methods were successfully used for varying fragment sizes.

## Abstract

In the human genome, heterozygous sites refer to genomic positions with a different allele or nucleotide variant on the maternal and paternal chromosomes. Resolving these allelic differences by chromosomal copy, also known as phasing, is achievable on a short-read sequencer when using a library preparation method that captures long-range genomic information. TELL-Seq is a library preparation that captures long-range genomic information with the aid of molecular identifiers (barcodes). The same barcode is used to tag the reads derived from the same long DNA fragment within a range of up to 200 kilobases (kb), generating linked-reads. This strategy can be used to phase an entire genome. Here, we introduce a TELL-Seq protocol developed for targeted applications, enabling the phasing of enriched loci of varying sizes, purity levels, and heterozygosity. To validate this protocol, we phased 2–200 kb loci enriched with different methods: CRISPR/Cas9-mediated excision coupled with pulse-field electrophoresis for the longest fragments, CRISPR/Cas9-mediated protection from exonuclease digestion for mid-size fragments, and long PCR for the shortest fragments. All selected loci have known clinical relevance: BRCA1, BRCA2, MLH1, MSH2, MSH6, APC, PMS2, SCN5A-SCN10A, and PKI3CA. Collectively, the analyses show that TELL-Seq can accurately phase 2–200 kb targets using a short-read sequencer.

## Linked entities

- **Genes:** BRCA1 (BRCA1 DNA repair associated) [NCBI Gene 672], BRCA2 (BRCA2 DNA repair associated) [NCBI Gene 675], MLH1 (mutL homolog 1) [NCBI Gene 4292], MSH2 (mutS homolog 2) [NCBI Gene 4436], MSH6 (mutS homolog 6) [NCBI Gene 2956], APC (APC regulator of Wnt signaling pathway) [NCBI Gene 324], PMS2 (PMS1 homolog 2, mismatch repair system component) [NCBI Gene 5395]

## Full-text entities

- **Genes:** SCN10A (sodium voltage-gated channel alpha subunit 10) [NCBI Gene 6336] {aka FEPS2, Nav1.8, PN3, SNS}, PMS2 (PMS1 homolog 2, mismatch repair system component) [NCBI Gene 5395] {aka HNPCC4, LYNCH4, MLH4, MMRCS4, PMS-2, PMSL2}, APC (APC regulator of Wnt signaling pathway) [NCBI Gene 324] {aka BTPS2, DESMD, DP2, DP2.5, DP3, GS}, MSH6 (mutS homolog 6) [NCBI Gene 2956] {aka GTBP, GTMBP, HNPCC5, HSAP, LYNCH5, MMRCS3}, SCN5A (sodium voltage-gated channel alpha subunit 5) [NCBI Gene 6331] {aka CDCD2, CMD1E, CMPD2, HB1, HB2, HBBD}, MSH2 (mutS homolog 2) [NCBI Gene 4436] {aka COCA1, FCC1, HNPCC, HNPCC1, LCFS2, LYNCH1}, BRCA1 (BRCA1 DNA repair associated) [NCBI Gene 672] {aka BRCAI, BRCC1, BROVCA1, FANCS, IRIS, PNCA4}, BRCA2 (BRCA2 DNA repair associated) [NCBI Gene 675] {aka BRCC2, BROVCA2, FACD, FAD, FAD1, FANCD}, MLH1 (mutL homolog 1) [NCBI Gene 4292] {aka COCA2, FCC2, HNPCC, HNPCC2, LYNCH2, MLH-1}
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC10997766/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC10997766/full.md

## References

55 references — full list in the complete paper: https://tomesphere.com/paper/PMC10997766/full.md

---
Source: https://tomesphere.com/paper/PMC10997766