Dynamic Kernel Matching for Non-conforming Data: A Case Study of T-cell Receptor Datasets
Jared Ostmeyer, Scott Christley, Lindsay Cowell

TL;DR
This paper introduces dynamic kernel matching (DKM), a method to adapt statistical classifiers for non-conforming data like T-cell receptor sequences, demonstrating its effectiveness in disease diagnosis datasets.
Contribution
The paper presents DKM, a novel approach to modify classifiers for non-conforming data, with successful application to TCR datasets for disease diagnosis.
Findings
Successfully fitted classifiers with DKM to TCR datasets
Achieved accurate predictions on holdout data
Identified biologically relevant patterns in classifier decisions
Abstract
Most statistical classifiers are designed to find patterns in data where numbers fit into rows and columns, like in a spreadsheet, but many kinds of data do not conform to this structure. To uncover patterns in non-conforming data, we describe an approach for modifying established statistical classifiers to handle non-conforming data, which we call dynamic kernel matching (DKM). As examples of non-conforming data, we consider (i) a dataset of T-cell receptor (TCR) sequences labelled by disease antigen and (ii) a dataset of sequenced TCR repertoires labelled by patient cytomegalovirus (CMV) serostatus, anticipating that both datasets contain signatures for diagnosing disease. We successfully fit statistical classifiers augmented with DKM to both datasets and report the performance on holdout data using standard metrics and metrics allowing for indeterminant diagnoses. Finally, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Inference · Imbalanced Data Classification Techniques
