An algorithm to align a chain of sequences to paths in a pangenome graph
Gorkem Kadir Solun, Ugur Dogrusoz

TL;DR
This paper presents an algorithm for aligning a sequence chain to paths in a pangenome graph, aiding functional analysis by identifying maximal paths that match the sequence chain with optional gaps.
Contribution
The novel algorithm efficiently finds all maximal paths in a pangenome graph aligning with a given sequence chain, enhancing pangenome graph analysis capabilities.
Findings
Algorithm successfully identifies all maximal paths matching sequence chains.
Supports user-defined maximal gaps in path alignment.
Facilitates functional analysis of complex pangenome graphs.
Abstract
Affordable, high-quality whole-genome assemblies have made it possible to construct rich pangenomes that capture haplotype diversity across many species. As these datasets grow, they motivate the development of specialized techniques capable of handling the dense sequence variation found in large groups of related genomes. A common strategy is to encode pangenomic information in graph form, which provides a flexible substrate for improving algorithms in areas such as alignment, visualization, and functional analysis. Methods built on these graph models have already shown clear advantages in core bioinformatics workflows, including read mapping, variant discovery, and genotyping. By integrating multiple sequence and coordinate representations into a single structure, pangenome graphs offer a unified and expressive framework for comparative genomics. Although it remains unclear whether…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Genetic Associations and Epidemiology · Genome Rearrangement Algorithms
