repgenHMM: a dynamic programming tool to infer the rules of immune receptor generation from sequence data
Yuval Elhanati, Quentin Marcou, Thierry Mora, and Aleksandra M., Walczak

TL;DR
This paper introduces repgenHMM, a dynamic programming tool using Hidden Markov Models to accurately infer the probabilistic rules of immune receptor gene rearrangements from sequence data, enabling better understanding of immune diversity.
Contribution
It presents a novel HMM-based method that efficiently infers the parameters of immune receptor generation, improving over previous computationally intensive approaches.
Findings
Successfully inferred rearrangement parameters from real T cell receptor sequences.
Generated synthetic sequences matching known models, validating the method.
Estimated immune repertoire diversity to be approximately 10^23 for human T cells.
Abstract
The diversity of the immune repertoire is initially generated by random rearrangements of the receptor gene during early T and B cell development. Rearrangement scenarios are composed of random events -- choices of gene templates, base pair deletions and insertions -- described by probability distributions. Not all scenarios are equally likely, and the same receptor sequence may be obtained in several different ways. Quantifying the distribution of these rearrangements is an essential baseline for studying the immune system diversity. Inferring the properties of the distributions from receptor sequences is a computationally hard problem, requiring enumerating every possible scenario for every sampled receptor sequence. We present a Hidden Markov model, which accounts for all plausible scenarios that can generate the receptor sequences. We developed and implemented a method based on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
