Prediction of Metabolic Pathways Involvement in Prokaryotic UniProtKB Data by Association Rule Mining
Imane Boudellioua, Rabie Saidi, Maria Martin, Robert Hoehndorf, and, Victor Solovyev

TL;DR
This paper presents a rule mining-based system to predict metabolic pathways in prokaryotic proteins, significantly expanding pathway annotations in UniProtKB with high accuracy.
Contribution
It introduces a novel association rule mining approach for automatic protein pathway annotation, achieving high predictive performance and large-scale application.
Findings
Achieved an F1-measure of 0.982 in pathway prediction
Predicted pathways for over 663,000 UniProtKB entries
Expanded pathway annotations for many previously unannotated proteins
Abstract
The widening gap between known proteins and their functions has encouraged the development of methods to automatically infer annotations. Automatic functional annotation of proteins is expected to meet the conflicting requirements of maximizing annotation coverage, while minimizing erroneous functional assignments. This trade-off imposes a great challenge in designing intelligent systems to tackle the problem of automatic protein annotation. In this work, we present a system that utilizes rule mining techniques to predict metabolic pathways in prokaryotes. The resulting knowledge represents predictive models that assign pathway involvement to UniProtKB entries. We carried out an evaluation study of our system performance using cross-validation technique. We found that it achieved very promising results in pathway identification with an F1-measure of 0.982 and an AUC of 0.987. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
