TL;DR
PRIME introduces a multiscale, physics-informed hierarchical framework for protein representation, integrating structural information across five levels to improve predictive performance on various benchmarks.
Contribution
It is the first to explicitly model hierarchical relationships across multiple physical structural levels in proteins for representation learning.
Findings
PRIME outperforms baseline methods on Fold Classification benchmarks.
Achieves 84.10% accuracy on Reaction Class prediction, surpassing existing models.
Each structural level provides unique, non-redundant information for protein tasks.
Abstract
Proteins are inherently multiscale physical systems whose functional properties emerge from coordinated structural organization across multiple spatial resolutions, ranging from atomic interactions to global fold topology. However, existing protein representation learning methods typically operate at a single structural level or treat different sources of structural information as parallel modalities, without explicitly modeling their hierarchical relationships. We introduce PRIME (Protein Representation via Physics-Informed Multiscale Equivariant Hierarchies), a unified framework that models proteins as a nested family of five physically grounded structural graphs spanning surface, atomic, residue, secondary-structure, and protein levels. Adjacent levels are connected through deterministic, physics-informed assignment operators, enabling bidirectional information exchange via bottom-up…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
