# XGBMUT: Predicting the Functional Impact of Missense Mutations Using an Extreme Gradient Boost Classifier

**Authors:** Gabriel Rodrigues Coutinho Pereira, Loiane Mendonça
Abrantes Da Conceição, Bárbara
de Azevedo Abrahim-Vieira, Carlos Rangel Rodrigues, Lucio Mendes Cabral, Ricardo Limongi
França Coelho, Joelma Freire De Mesquita

PMC · DOI: 10.1021/acsomega.4c10179 · ACS Omega · 2025-02-19

## TL;DR

XGBMut is a new tool that predicts the impact of genetic mutations using machine learning, making it faster and easier to analyze mutations without complex setups.

## Contribution

XGBMut is a novel machine learning algorithm for predicting missense mutation functionality with a standalone, user-friendly interface.

## Key findings

- XGBMut achieved high accuracy in classifying the functional impact of missense mutations.
- XGBMut outperformed ten existing prediction algorithms in performance evaluations.
- The tool is accessible without requiring web servers or third-party software.

## Abstract

Millions of new mutations have been discovered largely
due to advancements
in genome projects, but characterizing their effects through traditional
wet-lab experiments remains labor-intensive and time-consuming. Functional
prediction algorithms offer a solution by enabling the efficient screening
of mutations, thereby saving time and resources. The objective of
this study was to develop a competitive algorithm for predicting the
functional impact of missense mutations. A unified database and substitution
matrices containing predictor variables specifically for missense
mutations were initially constructed. Subsequently, values for the
predictor variables were collected from the training and test sets
derived from the ClinVar and HumsaVar databases. A series of supervised
machine learning classifiers were then trained, and their performance
was evaluated using the test set. The best-performing model was additionally
compared against ten currently available functional prediction algorithms.
The proposed algorithm, XGBMut, demonstrates exceptional accuracy
in classifying missense mutations while also exhibiting a competitive
performance. Additionally, a user-friendly graphical interface was
developed to enhance accessibility for professionals in various fields.
Unlike most existing methods, XGBMut eliminates the need for a web
server dependency and the installation of third-party software, making
it a more versatile tool for users.

## Full-text entities

- **Genes:** PON1 (paraoxonase 1) [NCBI Gene 5444] {aka ESA, MVCD5, PON}, HBB (hemoglobin subunit beta) [NCBI Gene 3043] {aka CD113t-C, ECYT6, beta-globin}
- **Diseases:** HbS (MESH:D000755), genetic and metabolic disorders (MESH:D030342), abnormal hemoglobin (MESH:D006445), rare diseases (MESH:D035583), cancer (MESH:D009369), hemoglobinopathy (MESH:D006453)
- **Chemicals:** amino acid (MESH:D000596), metal (MESH:D008670), oxygen (MESH:D010100)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Mutations:** E6 V

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11886911/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11886911/full.md

## References

61 references — full list in the complete paper: https://tomesphere.com/paper/PMC11886911/full.md

---
Source: https://tomesphere.com/paper/PMC11886911