Latent protein trees
Ricardo Henao, J. Will Thompson, M. Arthur Moseley, Geoffrey S., Ginsburg, Lawrence Carin, Joseph E. Lucas

TL;DR
This paper introduces a hierarchical Bayesian model that captures complex correlation structures in proteomics data, leveraging partial identification and observed correlations to infer latent proteins and their relationships.
Contribution
The paper presents a novel Bayesian approach that models multi-level correlation structures in proteomics data, integrating peptide identification and data correlations.
Findings
Effective in artificial and benchmark data
Successfully applied to influenza blood plasma data
Reveals latent protein correlation structures
Abstract
Unbiased, label-free proteomics is becoming a powerful technique for measuring protein expression in almost any biological sample. The output of these measurements after preprocessing is a collection of features and their associated intensities for each sample. Subsets of features within the data are from the same peptide, subsets of peptides are from the same protein, and subsets of proteins are in the same biological pathways, therefore, there is the potential for very complex and informative correlational structure inherent in these data. Recent attempts to utilize this data often focus on the identification of single features that are associated with a particular phenotype that is relevant to the experiment. However, to date, there have been no published approaches that directly model what we know to be multiple different levels of correlation structure. Here we present a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
