Ask2Me VarHarmonizer: A Python-Based Tool to Harmonize Variants from Cancer Genetic Testing Reports and Map them to the ClinVar Database
Yuxi Liu, Kanhua Yin, Basanta Lamichhane, John F. Sandbach, Gayle, Patel, Gia Compagnoni, Richard H. Kanak, Barry Rosen, David P. Ondrula, Linda, Smith, Eric Brown, Linsey Gold, Pat Whitworth, Colleen App, David Euhus, Alan, Semine, S. Dwight Lyons, Melford Allan C. Lazarte

TL;DR
Ask2Me VarHarmonizer is a Python tool that automates the harmonization and mapping of genetic variants from cancer reports to the ClinVar database, improving consistency and accuracy in variant interpretation.
Contribution
This paper introduces Ask2Me VarHarmonizer, a novel Python-based tool that automates variant data cleaning, harmonization, and mapping to ClinVar, addressing inconsistencies across laboratories.
Findings
78% of variants successfully mapped to ClinVar
High mapping consistency of 99% with and without transcript info
Identified 4798 unique variants after harmonization
Abstract
PURPOSE: The popularity of germline genetic panel testing has led to a vast accumulation of variant-level data. Variant names are not always consistent across laboratories and not easily mappable to public variant databases such as ClinVar. A tool that can automate the process of variants harmonization and mapping is needed to help clinicians ensure their variant interpretations are accurate. METHODS: We present a Python-based tool, Ask2Me VarHarmonizer, that incorporates data cleaning, name harmonization, and a four-attempt mapping to ClinVar procedure. We applied this tool to map variants from a pilot dataset collected from 11 clinical practices. Mapping results were evaluated with and without the transcript information. RESULTS: Using Ask2Me VarHarmonizer, 4728 out of 6027 variant entries (78%) were successfully mapped to ClinVar, corresponding to 3699 mappable unique variants. With…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Rare Diseases · Cancer Genomics and Diagnostics · Epigenetics and DNA Methylation
