# Transmembrane Homodimers Interface Identification: Predicting Interface Residues in Alpha-Helical Transmembrane Protein Homodimers Using Sequential and Structural Features

**Authors:** Bander Almalki, Li Liao

PMC · DOI: 10.3390/ijms26094270 · International Journal of Molecular Sciences · 2025-04-30

## TL;DR

This paper introduces a machine learning method to identify interface residues in transmembrane protein dimers, outperforming existing methods by combining sequential and structural features.

## Contribution

A novel machine learning framework that integrates sequential and structural features to predict interface residues in transmembrane homodimers.

## Key findings

- The proposed method outperforms state-of-the-art methods by over three percentage points in F1 score.
- The model surpasses multimeric structure predictors like RoseTTAFold2 and AlphaFold2Multimer.
- Using fewer features, the method achieves superior performance through effective integration of structural and sequential data.

## Abstract

Most bitopic transmembrane proteins associate with one another through interface residues to form dimers, which facilitate or activate specific cellular functions. Therefore, accurately identifying interface residues in a given dimer is crucial for understanding its function and has been a challenging pursuit for many computational methods. These methods can be broadly categorized into two approaches: general-purpose ones for dimerization and specialized ones for interface residues. In this study, we develop a machine learning method that integrates both approaches by integrating sequential and structural features extracted from predicted structures and various domains. The results from cross-validation on a benchmark dataset show that our method, despite utilizing significantly fewer features, outperforms the state-of-the-art methods by more than three percentage points in performance, as measured by the F1 score. Furthermore, we evaluated the performance of the proposed model on a benchmark dataset as compared to the state-of-the-art multimeric structure predictors, including RoseTTAFold2, AlphaFold2Multimer, and PREDDIMER. The results show the superiority of the proposed model by outperforming all the other models, highlighting the effectiveness of integrating both structural and sequential features within the proposed framework.

## Full-text entities

- **Genes:** SLC2A1 (solute carrier family 2 member 1) [NCBI Gene 6513] {aka CSE, DYT17, DYT18, DYT9, EIG12, GLUT}, EDNRA (endothelin receptor type A) [NCBI Gene 1909] {aka ET-A, ETA, ETA-R, ETAR, ETRA, MFDA}
- **Diseases:** oncological diseases (MESH:D000072716), injury to (MESH:D014947)
- **Chemicals:** LIPid (MESH:D008055), TMH (-), amino acids (MESH:D000596), glycine (MESH:D005998)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12073085/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12073085/full.md

## References

42 references — full list in the complete paper: https://tomesphere.com/paper/PMC12073085/full.md

---
Source: https://tomesphere.com/paper/PMC12073085