Inference of B cell clonal families using heavy/light chain pairing information
Duncan K. Ralph, Frederick A. Matsen IV

TL;DR
This paper introduces a new method for inferring B cell clonal families by leveraging heavy/light chain pairing information from high-throughput sequencing, improving accuracy especially for challenging light chain data.
Contribution
It presents a novel approach to incorporate pairing information into clonal family inference, along with methods to correct imperfect pairing data and enhancements to the partis software.
Findings
Improved clustering accuracy using pairing information.
Effective correction methods for imperfect pairing data.
Enhanced software with new features for BCR analysis.
Abstract
Next generation sequencing of B cell receptor (BCR) repertoires has become a ubiquitous tool for understanding the antibody-mediated immune response: it is now common to have large volumes of sequence data coding for both the heavy and light chain subunits of the BCR. However, until the recent development of high throughput methods of preserving heavy/light chain pairing information, these samples contained no explicit information on which heavy chain sequence pairs with which light chain sequence. One of the first steps in analyzing such BCR repertoire samples is grouping sequences into clonally related families, where each stems from a single rearrangement event. Many methods of accomplishing this have been developed, however, none so far has taken full advantage of the newly-available pairing information. This information can dramatically improve clustering performance, especially…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMonoclonal and Polyclonal Antibodies Research · T-cell and B-cell Immunology · Glycosylation and Glycoproteins Research
