CAP: Commutative Algebra Prediction of Protein-Nucleic Acid Binding Affinities
Mushal Zia, Faisal Suwayyid, Yuta Hozumi, JunJie Wee, Hongsong Feng, Guo-Wei Wei

TL;DR
CAP introduces a novel method combining commutative algebra and advanced embeddings to accurately predict protein-nucleic acid binding affinities, achieving high accuracy, interpretability, and efficiency without relying on 3D structures.
Contribution
The paper presents CAP, a new approach integrating persistent Stanley-Reisner theory with sequence embeddings for improved binding affinity prediction.
Findings
CAP outperforms existing benchmarks in binding affinity prediction.
CAP maintains performance on new datasets, demonstrating robustness.
Method enables genome-scale analysis without 3D structural data.
Abstract
An accurate prediction of protein-nucleic acid binding affinity is vital for deciphering genomic processes, yet existing approaches often struggle in reconciling high accuracy with interpretability and computational efficiency. In this study, we introduce commutative algebra prediction (CAP), which couples persistent Stanley-Reisner theory with advanced sequence embedding for predicting protein-nucleic acid binding affinities. CAP encodes proteins through transformer-learned embeddings that retain long-range evolutionary context and represents DNA and RNA with -mer algebra embeddings derived from persistent facet ideals, which capture fine-scale nucleotide geometry. We demonstrate that CAP surpasses the SVSBI protein-nucleic acid benchmark and, in a further test, maintains reasonable performance on newly curated protein-RNA and protein-nucleic acid datasets. Leveraging only…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
