# An interpretable alphabet for local protein structure search based on amino acid neighborhoods

**Authors:** Saba Zerefa, Jesse Cool, Pramesh Singh, Samantha Petti

PMC · DOI: 10.1093/bioinformatics/btaf458 · Bioinformatics · 2025-08-23

## TL;DR

The paper introduces a new interpretable method for comparing protein structures using local amino acid neighborhoods, improving search performance when combined with existing methods.

## Contribution

The novel contribution is the creation of the '3Dn' structural alphabet, which enhances protein structure search when combined with other alphabets.

## Key findings

- The '3Dn' structural alphabet improves protein structure search performance when combined with Foldseek’s 3Di alphabet.
- The method ranks best among local search methods that do not use amino acid identity information.
- Software tools are provided to explore new alphabets and their combinations for protein structure search.

## Abstract

Recent advancements in protein structure prediction methods have vastly increased the size of databases of protein structures, necessitating fast methods for protein structure comparison. Search methods that find structurally similar proteins can be applied to find remote homologs, study the functional relationships among proteins, and aid in protein engineering tasks.

We design a “3Dn” structural alphabet that encodes the local neighborhoods around each amino acid in an interpretable way. In a search benchmark task, a combination of our alphabet and Foldseek’s 3Di alphabet, outperforms each alphabet individually and ranks best among local search methods that do not require amino acid identity information. We provide software tools that enable the exploration of novel alphabets and combinations of alphabets for protein structure search.

The code is freely available at https://github.com/spetti/structure_comparison and at Zenodo https://doi.org/10.5281/zenodo.15734371.

## Full-text entities

- **Chemicals:** amino acid (MESH:D000596)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12516309/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12516309/full.md

## References

34 references — full list in the complete paper: https://tomesphere.com/paper/PMC12516309/full.md

---
Source: https://tomesphere.com/paper/PMC12516309