Algorithms for Reconstructing B Cell Lineages in the Presence of Context-Dependent Somatic Hypermutation
Yongkang Li, Kevin J. Wiehe, Scott C. Schmidler

TL;DR
This paper presents a novel method for reconstructing B cell lineages that accounts for context-dependent hypermutation, improving accuracy over traditional models, and demonstrates its effectiveness on both simulated and real HIV antibody data.
Contribution
The paper introduces a new approach combining data-augmentation and importance sampling to incorporate site-dependence in phylogenetic inference, compatible with existing software.
Findings
Accounting for context-dependence improves lineage and ancestral sequence reconstruction.
Incorporating priors based on VDJ recombination enhances germline region accuracy.
Modified priors reduce errors in non-templated nucleotide regions.
Abstract
We introduce a method for approximating posterior probabilities of phylogenetic trees and reconstructing ancestral sequences under models of sequence evolution with site-dependence, where standard phylogenetic likelihood computations (pruning) fail. Our approach uses a combined data-augmentation and importance sampling scheme. A key advantage of our approach is the ability to leverage existing highly optimized phylogenetic software. We apply our approach to the reconstruction of B cell receptor affinity maturation lineages from high-throughput repertoire sequencing data and evaluate the impact of incorporating site-dependence on the reconstruction accuracy of both trees and ancestral sequences. We show that accounting for context-dependence during inference always improves the estimates of both ancestral sequences and lineage trees on simulated datasets. We also examine the impact of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · HIV Research and Treatment · vaccines and immunoinformatics approaches
