PIntron: a Fast Method for Gene Structure Prediction via Maximal Pairings of a Pattern and a Text
Paola Bonizzoni, Gianluca Della Vedova, Yuri Pirola and, Raffaella Rizzi

TL;DR
PIntron is a fast and accurate gene structure prediction method that efficiently processes large transcript clusters by leveraging maximal pairings and redundancy in transcript data.
Contribution
The paper introduces a novel algorithm combining maximal pairings and redundancy exploitation for rapid, accurate gene structure prediction from large transcript datasets.
Findings
Processes large EST clusters in seconds
Achieves high sensitivity and specificity
Outperforms existing tools in speed and accuracy
Abstract
Current computational methods for exon-intron structure prediction from a cluster of transcript (EST, mRNA) data do not exhibit the time and space efficiency necessary to process large clusters of over than 20,000 ESTs and genes longer than 1Mb. Guaranteeing both accuracy and efficiency seems to be a computational goal quite far to be achieved, since accuracy is strictly related to exploiting the inherent redundancy of information present in a large cluster. We propose a fast method for the problem that combines two ideas: a novel algorithm of proved small time complexity for computing spliced alignments of a transcript against a genome, and an efficient algorithm that exploits the inherent redundancy of information in a cluster of transcripts to select, among all possible factorizations of EST sequences, those allowing to infer splice site junctions that are highly confirmed by the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · RNA and protein synthesis mechanisms · Genomics and Chromatin Dynamics
