# Interrelational Proteomic Sequence Features Enhance Predictive Modeling: Application to COVID-19 Severity

**Authors:** Radwa El-Awadi, Oscar D. Gomez, Daniel Castillo-Secilla, Carolina Torres, Luis J. Herrera, Ignacio Rojas, Francisco M. Ortuño

PMC · DOI: 10.3390/biomedicines14020378 · Biomedicines · 2026-02-06

## TL;DR

This paper introduces INPROF, a web tool that enhances predictive modeling by analyzing protein sequence relationships, with applications in predicting the severity of diseases like COVID-19.

## Contribution

The novel INPROF tool automates the computation of 46 proteomic features from protein sequences to improve predictive modeling.

## Key findings

- INPROF accurately classified the severity of COVID-19 using RNA-Seq data from patients.
- The tool outperformed traditional classification methods based on gene expression data.
- INPROF computes features across multiple proteomic levels, including sequences, structures, and domains.

## Abstract

Background: Comparing biological properties among related proteins has traditionally benefited research in areas such as biomedicine, phylogeny and evolution. Moreover, these kinds of properties have significantly increased as a result of the development of open-access resources and databases. In this context, the multiple sequence alignment (MSA) methods continue to be extensively applied to compare protein sequences and to identify evolutionarily conserved regions. Methods: In this work, we present INPROF, a novel web server designed to centralize and automate the computation of a large collection of features derived from protein sequences. This tool allows us to deeply analyze protein relationships and their conserved information by comparing them through their MSA. Specifically, this platform computes up to 46 different metrics including information at several proteomic levels (categories) like sequences, structures, domains or ontological terms. INPROF was designed to enable bioinformaticians and researchers to create a complete catalogue of features for subsequent prediction and artificial intelligence modeling based on proteins. The INPROF web server and source code are freely available. Results: INPROF were validated by predicting disease’s severity in several RNA-Seq datasets from COVID-19 patients. Specifically, INPROF were extracted from canonical proteins related to differentially expressed genes. Classification performance proved that INPROF were able to accurately classify COVID-19 severity, even outperforming classical classification with expression data. Conclusions: INPROF web server is a flexible platform designed to compute 46 quantitative metrics describing protein interactions which provide biologically meaningful characteristics applicable to downstream classification and prediction algorithms.

## Linked entities

- **Diseases:** COVID-19 (MONDO:0100096)

## Full-text entities

- **Diseases:** autoimmune disorders (MESH:D001327), kidney injury (MESH:D007674), infectious diseases (MESH:D003141), ARDS (MESH:D012128), death (MESH:D003643), neurodegenerative diseases (MESH:D019636), injury to (MESH:D014947), inflammation (MESH:D007249), pulmonary edema (MESH:D011654), INPROF (MESH:D011488), multiorgan failure (MESH:D051437), infected (MESH:D007239), COVID (MESH:D000086382), cancer (MESH:D009369)
- **Chemicals:** acid (MESH:D000143), amino (-), Amino-acid (MESH:D000596)
- **Species:** Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049], Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12937695/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12937695/full.md

## References

68 references — full list in the complete paper: https://tomesphere.com/paper/PMC12937695/full.md

---
Source: https://tomesphere.com/paper/PMC12937695