FIT: Tag based method for fusion proteins identification

Kang Ning; Alexey I. Nesvizhskii

arXiv:1408.4950·q-bio.QM·August 22, 2014

FIT: Tag based method for fusion proteins identification

Kang Ning, Alexey I. Nesvizhskii

PDF

Open Access

TL;DR

The paper introduces FIT, a sequence tag-based algorithm that improves the identification of fusion proteins in mass spectrometry data by combining de novo sequencing and peptide-spectrum matching, achieving high sensitivity and low false positives.

Contribution

The novel FIT algorithm effectively detects fusion proteins in proteomic datasets using a combined approach of sequence tags and peptide-spectrum matching.

Findings

01

High sensitivity in simulated datasets

02

Low false positive rates

03

Effective fusion protein identification

Abstract

There is increased interest in the identification and analysis of gene fusions and chimeric RNA transcripts. While most recent efforts focused on the analysis of genomic and transcriptomic data, identi-fication of novel peptides corresponding to such events in mass spectrometry-based proteomic datasets would provide complemen-tary, protein-level evidence. The process of identifying fusion pro-teins from mass spectrometry data is inherently difficult because such events are rare. It is also complicated due to large amount of spectra collected and the explosion in the number of candidate peptide sequences that need to be considered, which makes ex-haustive search for all possible fusion partner proteins impractical. In this work, we present a sequence tag based fusion protein identi-fication algorithm, FIT, that combines the virtue of both de novo sequence tag retrieval and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Proteomics Techniques and Applications · Genomics and Phylogenetic Studies · Machine Learning in Bioinformatics