# epLSAP-Align: a non-sequential protein structural alignment solver with entropy-regularized partial linear sum assignment problem formulation

**Authors:** Xuechen Zhang, Zhuoyang Chen, Junyu Li, Qiong Luo, Longjun Wu, Weichuan Yu

PMC · DOI: 10.1093/bioinformatics/btaf309 · Bioinformatics · 2025-05-20

## TL;DR

This paper introduces epLSAP-Align, a new method for aligning protein structures that improves accuracy and efficiency for non-sequential alignments.

## Contribution

The novel contribution is formulating non-sequential protein alignment as an entropy-regularized partial linear sum assignment problem and solving it efficiently.

## Key findings

- epLSAP-Align outperforms existing non-sequential alignment tools in terms of biological structure overlaps.
- epLSAP-TM is at least 22% faster than USalign2 under the same conditions.
- The method integrates well with existing frameworks like TM-align and MICAN.

## Abstract

The three-dimensional protein tertiary structure alignment is a fundamental problem that seeks insights into functions and evolution. Previous structure alignment algorithms have adopted the sequential assumption and used dynamic programming solvers. However, many distantly related structures exhibit non-sequential similarities, and non-sequential alignment tools are less efficient and accurate than sequential ones. In this paper, we formulate the non-sequential alignment as the Entropy-regularized Partial Linear Sum Assignment Problem (epLSAP) and propose a solver based on Sinkhorn algorithms, referred to as epLSAP-Align.

Compared with existing non-sequential alignment solvers, our epLSAP-Align can explicitly model the gap penalty, efficiently achieve global optimality and balance coverage and fidelity. We show that epLSAP-Align can be easily integrated into the existing frameworks, such as TM-align and MICAN, resulting in the non-sequential alignment tool epLSAP-TM and epLSAP-MICAN, respectively. Both epLSAP-TM and epLSAP-MICAN achieve better performance than the existing non-sequential alignment tools in terms of biologically meaningful structure overlaps on two sequential alignment test sets MALIDUP and MALISAM, and four non-sequential alignment test sets MALIDUP-ns, MALISAM-ns, 64-difficult-case and RIPC datasets. Also, compared with the most recent non-sequential alignment tool USalign2, our epLSAP-TM is at least 22% faster under the same setting.

Our source code is available at https://github.com/xzhangem/epLSAP-align.

## Full-text entities

- **Diseases:** NS (MESH:C580335)
- **Chemicals:** USalign2 (-)
- **Mutations:** C7015-23G

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12137893/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12137893/full.md

## References

42 references — full list in the complete paper: https://tomesphere.com/paper/PMC12137893/full.md

---
Source: https://tomesphere.com/paper/PMC12137893