GeoGAD: geometry-aware antibody design framework for complementarity-determining region precision engineering
Songjian Wei, Jinxiong Zhang, Yan Chen, Chunyan Tang, Jiayang Tan

TL;DR
GeoGAD is a new framework for designing antibodies that improves precision by better modeling the geometry of key regions involved in antigen binding.
Contribution
GeoGAD introduces a geometry-aware design framework with Gaussian attention mechanisms for improved antibody CDR modeling.
Findings
GeoGAD outperforms or matches state-of-the-art models in antibody sequence–structure co-modeling and CDR design.
The framework excels in amino acid recovery rates and structural accuracy metrics like RMSD and TM-score.
GeoGAD effectively models long-range dependencies while preserving local residue focus through its Gaussian attention mechanism.
Abstract
Antibodies, as pivotal effector molecules of the immune system, neutralize pathogens through specific binding to antigens mediated by complementarity-determining regions (CDRs), highlighting the critical importance of precise antibody design in diagnostics and therapeutics. Despite significant advances in CDR design, current methods remain limited by inadequate modeling of geometric constraints, omission of multi-scale spatial relationships, and insufficient conformational representation capacity—factors that collectively degrade prediction accuracy. To overcome these limitations, we present GeoGAD, a geometry-aware antibody design framework with Gaussian attention mechanisms. Key innovations include: (1) the introduction of rotational positional encoding to enhance geometric sensitivity; (2) a geometry-aware module that integrates multi-scale spatial features through dynamic message…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6| Model | CDR-H1 | CDR-H2 | CDR-H3 | |||
|---|---|---|---|---|---|---|
| AAR↑ |
| AAR↑ |
| AAR↑ |
| |
| DiffAb | 61.34 | 1.02 | 37.66 | 1.20 | 25.79 | 3.02 |
| AbDiffuser | 62.76 | 1.08 | 41.10 | 1.16 | 29.58 | 2.83 |
| MEAN | 58.29 | 0.98 | 41.15 | 0.95 | 36.38 | 2.21 |
| dyMEAN | 63.52 | 0.84 | 55.41 | 0.82 | 37.19 | 2.09 |
| ADesigner | 64.34 | 0.82 | 55.52 | 0.79 | 37.37 | 1.97 |
| RAAD |
|
|
|
|
|
|
| GeoGAD |
|
|
|
|
|
|
| Model | AAR↑ | TM-score↑ |
|
|---|---|---|---|
| RosettaAD | 22.50 | 0.9435 | 5.52 |
| RefineGNN | 29.79 | 0.8308 | 7.55 |
| C-RefineGNN | 28.90 | 0.8317 | 7.21 |
| HERN | 32.83 | 0.9684 | 3.12 |
| C-HERN | 34.51 | 0.9734 | 2.79 |
| AbEgDiffuser (t = 100) | 28.97 | 0.9802 | 2.99 |
| MEAN | 36.77 | 0.9812 | 1.81 |
| dyMEAN | 39.14 | 0.9825 | 1.66 |
| ADesigner | 40.94 | 0.9850 | 1.55 |
| RAAD | 41.26 |
| 1.46 |
| GeoGAD |
|
|
|
| Model |
|
|---|---|
| Random | +1.52 |
| LSTM | −1.48 |
| RefineGNN | −3.98 |
| C-RefineGNN | −3.79 |
| MEAN | −5.33 |
| dyMEAN | −7.31 |
| ADesigner | −10.78 |
| RAAD | −11.79 |
| GeoGAD | −12.41 |
| Modeling | Design | ||||
|---|---|---|---|---|---|
| AAR↑ |
| AAR↑ | TM-score↑ | Cα-RMSD↓ | |
| GeoGAD |
|
|
|
|
|
| w/o RoPE | 37.10 | 1.89 | 38.17 | 0.9831 | 1.50 |
| w/o Lennard-Jones potential | 37.22 | 1.85 | 38.90 | 0.9848 | 1.48 |
| w/o Multi-band positional encoder | 36.79 | 1.85 | 39.86 | 0.9847 | 1.51 |
| w/o Gaussian attention | 36.63 | 1.95 | 35.92 | 0.9805 | 1.55 |
| Modeling | Design | ||||
|---|---|---|---|---|---|
|
| AAR↑ |
| AAR↑ | TM-score↑ |
|
| 8 | 36.55 | 1.90 | 39.66 | 0.9828 | 1.53 |
| 16 | 36.62 | 1.91 | 40.07 | 0.9821 | 1.43 |
| 32 |
|
|
|
|
|
| 64 | 37.24 | 1.86 | 41.25 | 0.9833 | 1.51 |
| 128 | 36.47 | 1.91 | 40.94 | 0.9798 | 1.53 |
- —Guangxi Key Research and Development Plan10.13039/501100017691
- —National Natural Science Foundation of China10.13039/501100001809
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMonoclonal and Polyclonal Antibodies Research · vaccines and immunoinformatics approaches · Artificial Immune Systems Applications
1 Introduction
The primary goal of antibody design is the generation of therapeutic antibodies that bind target antigens with high affinity and specificity, thereby enabling effective neutralization or modulation of pathogens, cancer cells, or aberrant proteins. (Meganck and Baric 2021). This capability holds substantial promise for applications in infectious disease control, cancer immunotherapy, and autoimmune disorder intervention. In natural antibodies, precise recognition of antigenic epitopes is mediated by the complementarity-determining regions (CDRs) within the variable domains. Among the six CDRs, the heavy-chain CDR-H3 exhibits exceptional sequence and structural diversity and is widely regarded as the principal determinant of antigen-binding specificity (Maynard and Georgiou 2000, Akbar et al. 2021). Consequently, the accurate co-design of CDR-H3 sequence and three-dimensional conformation represents a central challenge in computational antibody engineering.
Antibodies are immunoglobulins utilized by the immune system to recognize and neutralize antigens (e.g., pathogens) (Murphy and Weaver 2016). Structurally, antibodies comprise two pairs of polypeptide chains—light and heavy chains—linked by disulfide bonds to form a characteristic Y-shaped architecture (Lu et al. 2020), as schematically depicted in Fig. 1. Each chain contains variable domains (VH for heavy chain; VL for light chain) followed by constant regions, with the variable regions organized into four framework regions (FRs) providing structural scaffolding and three CDRs (CDR-H1, CDR-H2, CDR-H3) responsible for antigen recognition.
Schematic representation of antibody-antigen complex structure. Antibodies are Y-shaped heterotetrameric proteins comprising two identical heavy chains and two identical light chains.
Fundamentally, antibody design constitutes a specialized form of function-driven protein engineering. When constrained to fixed backbone architectures, global sequence optimization based on conditional probability distributions remains a central challenge in protein design (Xiong et al. 2020, Chu et al. 2024). In a broader protein design context, several approaches have sought to bridge this gap. DenseCPD (Qi and Zhang 2020) predicted amino acid probabilities for each residue by modeling the 3D density distribution of backbone atoms. ProDESIGN-LE (Huang et al. 2023) iteratively refined residue compatibility by learning geometric-chemical features within a Transformer framework. ProteinMPNN (Dauparas et al. 2022) implemented autoregressive sequence generation via masked self-attention mechanisms on topological backbone features. RFdiffusion (Watson et al. 2023) generated protein structures via a reverse denoising process grounded in diffusion model principles. These methods establish residue-structure correlations through geometric topological feature extraction, enabling deep learning-driven design. Sequence-based methods offer the advantage of leveraging abundant sequence data, which mitigates attribution biases from structural feature engineering (Sinai et al. 2017) and enables direct sequence-function mapping. Transformer language models such as ESM-1b (Rives et al. 2021) and ProGen (Madani et al. 2023) demonstrate sequence-to-function inference capabilities through unsupervised learning of residue dependencies in large-scale datasets. However, these kinds of models encounter functional-structural decoupling challenges: generated sequences may lack explicit structural constraints, resulting in unstable folds or infeasible conformations (Jeon and Kim 2024).
Conventional antibody engineering techniques, including hybridoma-based monoclonal antibody production and phage display selection, rely on experimental screening and iterative optimization. These methods are not only time- and labor-intensive but also constrained by the combinatorial complexity of the antibody sequence-structure space. While computational methodologies such as homology modeling and molecular docking have partially mitigated these challenges, they remain limited in modeling highly variable regions like CDRs and capturing essential binding conformations for antigen recognition.
Recent advances in deep learning have transformed computational antibody design through data-driven exploration of sequence-structure-function relationships. Structure-based generation approaches (e.g., ABlooper [Abanades et al. 2022], Ig-VAE [Eguchi et al. 2022]) concentrate on CDRs modeling by predicting backbone atomic 3D coordinates. However, these methods exhibit critical limitations in globally co-optimized structural and functional properties. Their reliance on local conformational sampling fails to capture framework (FRs) and antigen epitopes, and they cannot concurrently optimize sequence affinity and developability.(Cheng et al. 2024) Transformer-based antibody language models (AntiBERTy [Ruffolo et al. 2021], AbLang [Olsen et al. 2022], IgLM [Shuai et al. 2023]) enable end-to-end sequence generation from epitopes, overcoming limitations of inverse folding. Nevertheless, such sequence-only frameworks fail to explicitly encode 3D geometric information, resulting in questionable foldability of the generated antibody sequences (Gallo 2024). E(3)-equivariant graph neural networks, deep generative models, and diffusion-based frameworks—have transformed computational antibody design through data-driven exploration of sequence-structure-function relationships. RefineGNN (Jin et al. 2022b) introduced co-design of sequences and 3D structures using geometric graph representations. Subsequent models HSRN (Jin et al. 2022a) further enhanced the accuracy and efficiency of antibody design and docking by integrating hierarchical equivariant modeling and iterative structural optimization. DiffAb (Luo et al. 2022) employed denoising diffusion probabilistic models for CDR design. AbEgDiffuser (Zhu et al. 2025) integrated bilevel equivariant graph neural networks with evolutionary constraints to guide the diffusion-based generation of antibody CDRs. MEAN (Kong et al. 2023a) captured inter-chain spatial interactions via E(3)-equivariant message passing, and dyMEAN (Kong et al. 2023b) enabled end-to-end antibody generation through structure initialization and shadow paratope engineering. GeoAB (Lin et al. 2024) introduced geometric initializer and refiner units to generate CDRs with biophysically realistic internal geometries. ADesigner (Tan et al. 2024) implemented cross-gate MLP for antibody sequence-structure co-modeling. RAAD (Wu et al. 2025) incorporated multimodal features for relation-aware antibody design. Despite these advances, three fundamental limitations remain. (i) Insufficient geometric sensitivity hinders accurate modeling. Antibody function critically depends on precise alignment between secondary structural elements (β-sheets, α-helices) and 3D spatial topology (Lin et al. 2024). However, conventional static positional encodings (Jin et al. 2022b; Kong et al. 2023a) fail to preserve SE(3)-equivariant representations during structural modeling, leading to physically infeasible predictions (Si and Yan 2024) that compromise stability. (ii) Geometric information decay occurs during long-sequence modeling. Previous work (Jeon and Kim 2024) demonstrates that geometric details degrade across message-passing layers, particularly in long CDR regions, preventing accurate structural reconstruction and impairing design precision. (iii) Inadequate multi-scale relationship integration limits model expressiveness. Antibody design requires simultaneous consideration of atomic-, residue-, and domain-level interactions, yet existing single-scale approaches (Luo et al. 2022, Kong et al. 2023b) neglect cross-scale dependencies that govern structural integrity.
To overcome these limitations, we propose GeoGAD (Geometry-aware Gaussian Attention-based Antibody Designer), a novel framework for structure-aware antibody design. GeoGAD incorporates rotary position embedding (RoPE) (Su et al. 2024) to capture relative sequential dependencies. Unlike absolute positional encodings, RoPE relies solely on topological indices, thereby addressing the SE(3)-equivariance preservation issue inherent in static representations. To simultaneously enhance geometric sensitivity and capture hierarchical geometric context, we introduce a geometry-aware module that fuses information across atomic, residue, and domain scales through dynamic message passing, adaptive edge feature updating, and multi-edge-type cooperative coordinate refinement. Additionally, we design an edge-type-sensitive Gaussian attention mechanism that leverages a spatial Gaussian kernel to concentrate attention on local critical residues while maintaining the capacity to model long-range dependencies—effectively mitigating geometric signal degradation in deep architectures. Taking both 1D sequences and 3D structures of antibody–antigen complexes as input, GeoGAD enables end-to-end joint generation of antibody sequences and conformations through the synergistic integration of geometric reasoning and Gaussian attention. Extensive experiments show that GeoGAD achieves state-of-the-art performance across three critical tasks: antibody sequence-structure co-modeling, CDR design, and affinity optimization, validating its effectiveness in addressing longstanding challenges in computational antibody design.
2 Materials and methods
2.1 Task formulation
An antibody-antigen complex consists of amino acid residues, with the -th position residue defined as , where and . Here, different letters correspond to distinct amino acid types. Each amino acid is characterized by the 3D coordinates of its four backbone atoms . Let denote the coordinate of backbone atom of -th position residue , 3D structure of an amino acid residue is characterized as a coordinate matrix , namely . Given an antibody-antigen complex , we define its CDRs as , where and represent the starting position and length of CDRs, respectively. Antibody design aims to establish a deterministic mapping . In this task, our proposed framework incorporates geometric information derived from antibody structural statistics to design CDRs with enhanced physical plausibility.
2.2 Overview
We propose a joint antibody sequence-structure co-design framework GeoGAD (Geometry-aware Gaussian Attention-based Antibody Designer), which receives 3D structural coordinates of antibody-antigen complexes and 1D sequences of antibody and antigen as dual-input, integrating multi-scale geometric constraints through progressive hierarchical modeling. The architecture of GeoGAD comprises three functional modules: (i) a geometric feature encoder that extracts sequence and structural features, (ii) a geometric-aware module with 4 layers of message-passing neural networks (MPNNs) incorporating Gaussian attention mechanisms for feature updating, and (iii) a collaborative decoder that simultaneously predicts categorical residue distributions and regresses 3D structural coordinates in a non-autoregressive manner. Fig. 2 presents the complete workflow.
Schematic workflow of the Geometry-aware Gaussian Attention-based Antibody Designer (GeoGAD). The framework receives dual-modal inputs: antibody-antigen complex sequences and structures, with CDR-H3 regions masked using dedicated mask tokens. The pipeline of GeoGAD proceeds through three stages: (A) Geometric feature encoding. Antibody/antigen sequences are embedded into node features and fused with geometrically annotated heterogeneous graphs to construct initial representations. (B) Geometry-Aware Gaussian Attention-based Framework. The initial representations are iteratively updated through four layers of message-passing neural networks (MPNNs) incorporating Gaussian attention mechanisms. (C) Cooperative Decoding. Leveraging the refined features, the Cooperative decoder synchronously predicts the probability distribution over amino acid types and the atomic coordinates for the masked CDR-H3 region, to generate sequences and structures by sampling from these distributions.
2.3 Antibody-antigen complex heterogeneous graph representation and feature extraction
We formulate antibody-antigen complexes as heterogeneous graph with node set , where represent heavy chain, light chain, and antigen residues respectively. Each node is represented by a geometric feature vector and a coordinate matrix encoding the 3D positions of its four backbone atoms . Following Lin et al. (Lin et al. 2024), the edge set comprises 3 heterogeneous edge types defined by spatial proximity, sequential continuity, and residue-residue correlations which collectively characterize the structural dependencies between residues and . To aggregate global context features, we introduce 3 global nodes for heavy chain, light chain and antigen chain respectively. These global nodes are fully connected with each other and linked to all other nodes within their respective chains.
To enhance contextual feature representation for CDRs, we define the following three heterogeneous edge types in graph : (i) Intra-chain Context Edge, which captures local dependencies using geometric constraints. These are categorized as: Spatial proximity edges (connecting residues with radial distance 8 ), Sequence adjacency edges (connecting residues with sequential offsets of ±1 and ±2), and K-nearest neighbor edges (connecting each residue to its 8 spatially closest neighbors). (ii) Global Edge, which facilitates structural information aggregation across chain components. These include Global-local edges (connecting each global node to all residue nodes within its corresponding chain) and Global-global edges (connecting global nodes of different chains). (iii) Antibody-Antigen Interface Edges, which model binding interface interactions between antibody and antigen chains. These are defined as: Radial interface edges (connecting residues between antibody and antigen chains with radial distance < 12 ), and k-nearest interface edges (connecting each antibody residue to its *K *= 8 nearest antigen neighbors). Specifically, for an edge , we construct a comprehensive edge feature vector by concatenating geometric and topological descriptors. is composed of: (i) a one-hot encoding vector indicating the specific edge category; (ii) a relative positional embedding derived from the sinusoidal encoding of the sequential distance ; (iii) a distance feature vector consisting of Radial Basis Function (RBF) encoded Euclidean distances between the atom of the target node and the four backbone atoms of the source node ; (iv) an orientation feature represented by 4-dimensional unit quaternions derived from the relative rotation matrix between the local reference frames of the two residues, and (v) a direction feature vector containing the geometric displacement vectors from the target node’s atom to the backbone atoms of the source node, projected into the target residue’s local reference frame to ensure rotational invariance.
For given node , its initial feature is defined as:
where denotes the masked residue type embedding; is the Rotary Position Embedding (RoPE), computed from the local position index of the residue within its polypeptide chain:
where . This formulation effectively captures relative sequential dependencies via rotation in the feature space. Since the embedding relies exclusively on sequence indices independent of the global 3D coordinate system, it preserves the SE(3)-invariance of the overall node representation, ensuring consistency under 3D rigid transformations. Complementing this topological encoding, the explicit 3D geometric information is incorporated through invariant physical descriptors computed from the backbone structure: denotes the set of three backbone interatomic distances: , , and , which are mapped to continuous features via a radial basis function ; consists of sine and cosine projections of backbone dihedral angles ( ) and bond angles ( ); To compute the orientation feature , we construct a local reference frame for each residue. We define the x-axis unit vector along the bond direction, and the z-axis as the normalized cross product of the and vector, with the y-axis determined by the right-hand rule. then represents the coordinates of the backbone atom displacement vectors projected into this local frame.
Building upon this heterogeneous graph representation, we develop a geometry-aware Gaussian attention framework to enable accurate joint sequence–structure design of antibody CDR regions.
2.4 Geometry-aware Gaussian attention antibody designer
We present a novel Geometry-aware Gaussian attention framework for antibody design, comprising a geometry-aware module (Fig. 3A) and a Gaussian attention module (Fig. 3B).
GeoGAD architecture. (A) Geometry-Aware Module: dynamically models antibody 3D structures by fusing multi-scale geometric features under physical constraints. The module comprises three components: Dynamic message passing with geometric priors, Adaptive edge feature updates, and Iterative coordinate refinement. This module integrates multi-band positional encoding and Lennard-Jones potential to enhance geometric awareness while ensuring physical plausibility. (B) Gaussian Attention Module: implements spatially accurate interaction modeling through edge-type-specific Gaussian kernels. The module operates in three stages: distance bias calculation using radial basis encoding functions (RBF), attention score generation, and context-aware node feature updating.
2.5 Geometry-aware module
The Geometry-Aware Module is designed to fuse multi-scale geometric context centered on individual residues. Specifically, the model encodes detailed geometric descriptors at the backbone atom level through high-dimensional embeddings, including interatomic relative positions, bond lengths, backbone dihedral angles, bond angles, and direction cosine vectors defined in a local reference frame. To capture diverse structural dependencies, the model employs heterogeneous edge types to explicitly represent sequential adjacency, spatial proximity, and intra- or inter-molecular interactions (e.g., within or between heavy chain, light chain, and antigen). Furthermore, to incorporate molecular-level context, global nodes are introduced for the heavy chain, light chain, and antigen, respectively, and chain-level information is aggregated via global–local attention or pooling mechanisms.
These multi-scale geometric signals are dynamically integrated through a multi-edge-type cooperative coordinate optimization mechanism. Each edge type is processed by a dedicated transformation network that produces scaling factors to differentially weigh the contributions of local geometric constraints, long-range spatial interactions, and molecular-entity guidance signals (see Equation (8)). This adaptive weighting enables unified modeling of local structural validity, inter-residue spatial compatibility, and global conformational consistency. The module operates in three sequential phases: (i) Dynamic geometric message passing, (ii) Adaptive edge feature updating, and (iii) Coordinate refinement.
Dynamic geometric message passing facilitates precise evolution of antibody 3D conformations via multi-layer geometric representation fusion and joint sequence-structure modeling. For residue pair within local neighborhoods, we define the geometric features matrix to capture multi-channel spatial interactions. Specifically, let denote the coordinate difference matrix for the four backbone atoms between residues and . We compute the inner products of these difference vectors to generate the geometric Gram matrix . These features are encoded into low-dimensional geometric descriptors through a differentiable mapping for efficient representation:
where denotes the vectorization.
This mechanism captures statistical properties of spatial distributions by preserving second-order moment information (covariance matrices) to enforce robustness to structural variations. Inspired by Mildenhall et al. (2022), we propose a multi-frequency positional encoder to capture spatial relationships across scales. Specifically, we define a set of log-spaced frequency bases , and for each residue pair , embed the relative displacement vector into a multi-scale geometric representation by element-wise multiplication with each , followed by sine and cosine mappings:
where denotes concatenation, represents element-wise multiplication. The frequency parameters , are uniformly spaced on a logarithmic scale over the interval This multi-frequency design enables the model to simultaneously capture fine-grained local geometry (via high-frequency components) and long-range spatial interactions (via low-frequency components), thereby mitigating geometric signal attenuation in deep network layers. Ablation studies (see Table 5) show that setting yields the best trade-off between amino acid recovery rate (AAR) and structural accuracy (RMSD). Finally, the enhanced geometric features are fed into the message generation network together with node features and edge-type encodings to drive geometry-aware message passing. For a given edge the message vector is computed via fusion of three feature components:
where denotes MLP acting on the feature pair, represents edge features, and constitutes a four-layer message generation network.
Adaptive edge updating incorporates molecular dynamic priors by combining the Lennard-Jones potential (Lennard-Jones 1931) function with original edge features. Specifically, to capture van der Waals interactions, we explicitly calculate the potential based on the Euclidean distance between the atoms of residue pairs:
where and are learnable parameters representing the potential depth and the zero-crossing distance, respectively. The calculated potential is then normalized to ensure numerical stability. The edge feature updating integrates three types of heterogeneous information, including node hidden states and , edge features , and Lennard-Jones potential , and then applies the multilayer perceptron to perform feature updates:
By integrating topological relationships and physical constraints into edge features, this design provides energy-guided optimization for coordinate refinement.
Coordinate updating employs multi-edge-type collaboration that enables adaptive modeling of geometric constraints via edge feature-based adaptive weighting. For node adjacent to -th edge type, type-specific transformation network maps the edge feature to a scaling factor , which is then broadcast across dimensions and element-wise multiplied with the coordinate difference vector to generate the edge-type-sensitive geometric displacement vector . Then an unordered segment-wise mean pooling strategy is exploited to integrate diverse edge type contributions. The final node coordinate update is formulated in a residual manner to enable iterative optimization, ensuring geometric plausibility of generated structures:
where denotes the coordinate of node at the -th iteration layer, represents the total number of edge types for node , denotes the neighborhood associated with the -th edge type for node , represents the cardinality of the total neighborhood of node , and is the edge-type-specific coordinate transformation network.
To further enhance the model’s ability to jointly capture local critical interactions and long-range dependencies, we introduce a Gaussian attention mechanism built upon the geometry-aware module.
2.6 Gaussian attention module
The Gaussian attention mechanism enables precise modeling of critical residues in antibody structures through consideration of spatial distance constraints and feature similarity metrics. In essence, antibody-antigen interactions are a highly specific recognition process, where the free energy contribution of antigen epitopes binding to CDRs is significantly higher than that to intra-chain other intra-chain regions. The proposed geometric-aware Gaussian attention mechanism employs an edge-type-sensitive weighting strategy via adaptive Gaussian bias terms to enhance focus on critical information.
The spatial Gaussian kernel incorporates a covariance matrix-based Gaussian bias term to model local interaction patterns in antibody 3D structures. For node pairs with edge type , their geometric features are defined as:
where denotes a learnable parameter that dynamically modulates the effective receptive field of the Gaussian kernel. The differentiable Gaussian kernel adaptively tunes the spatial receptive fields of distinct CDRs via backpropagation. It balances long-range structural coherence modeling with selective emphasis on local critical residues, thereby enhancing the discriminative capability for contextual structural patterns. Figure 4 presents the attention distribution heatmap of k-nearest neighbor edges and spatial sequential adjacency edges, using the heavy chain of antibody-antigen complex (PDB: 8dl6) as a case study. The results demonstrate preferential attention weights on local residues (e.g., adjacent or spatial neighbors), while non-local residue pairs with remote sequence distance also exhibit increased attention scores, confirming successful modeling of non-local spatial interactions. This observation validates the Gaussian attention mechanism’s ability to harmonize local residue prioritization with global structural coherence.
Visualization of edge attention distribution in antigen-antibody complex (PDB: 8dl6) heavy chain.
The node dynamic feature updating is based on the orthogonal projection mechanism, aiming to enhance the synergistic expression capability between geometric features and sequence features. Attention scores are computed via bilinear projection:
where denotes the edge-type-specific projection matrix and is the flattened covariance matrix concatenated with node and edge feature . After computing the Gaussian attention scores, messages from the -th edge type, denoted , are aggregated using the corresponding attention weights as follows:
The aggregated message is then concatenated with the node feature , transformed by a MLP, and added residually to produce the updated representation:
where denotes the MLP.
2.7 Antigen-biding CDR-H3 co-decoding
Our framework enables end-to-end joint sequence-structure generation in a single forward pass via co-decoding, eliminating iterative refinement or autoregressive steps.
Sequence amino acid prediction outputs the probability distribution of amino acid types based on the updated node features :
where denotes the node features of the -th masked residue, and and represent the classifier weight matrix and bias vector, respectively. This prediction mechanism relies solely on high-order node representations from the geometry-aware module without explicit concatenation of coordinate features, which reflects deep integration of sequence and structural information.
Atomic coordinate refinement is performed via residual updates for iterative optimization, where geometric displacement vectors for the four backbone atoms ( ) of residue are generated by the transformation network :
where encodes relative position information of node coordinates according to Equation (7). This design introduces minimal perturbations at each update step through multi-scale geometric information fusion, thereby maintaining the physical plausibility of geometric constraints such as bond angles and dihedral angles.
2.8 Loss function
The loss function comprises three distinct components: sequence reconstruction loss, coordinate reconstruction loss, and contrastive loss.
The sequence reconstruction loss ( ) is formulated as the cross-entropy criterion:
where denotes the set of masked amino acid positions and represents the true residue type.
The coordinate reconstruction loss ( ) is defined using the smooth L1 loss to quantify the deviation between predicted and ground-truth backbone atom coordinates:
where and denote the predicted and ground-truth coordinates, respectively, and ensures gradient smoothness.
In addition, we introduce a contrastive loss ( ) to encourage alignment between CDR and antigen epitope representations. For each antibody–antigen complex, we compute the mean residue embeddings over the CDR regions and the antigen epitope, respectively, and project them through a lightweight MLP head. The contrastive loss is then formulated as:
where and represent the CDRs features and corresponding antigen epitope features, respectively, while negative samples are sampled from other antigens within the same batch, with denoting the temperature coefficient.
The total loss function balances these three terms via a weighted summation:
where and modulate the contributions of structural coordinate reconstruction and contrastive learning, respectively, and are optimized through validation set calibration.
3 Experiments
We adopt the experimental settings following prior works (Adolf-Bryfogle et al. 2018) and evaluate our model through three experimental scenarios: (i) Sequence-structure co-modeling on the Rosetta Antibody Design Database (Adolf-Bryfogle et al. 2018); (ii) Antigen-binding CDR-H3 design; (iii) Antibody affinity optimization.
3.1 Baselines
We compare against state-of-the-art antibody design methods as baselines, including: (i) LSTM-based antibody design approach (Saka et al. 2021), generating sequences in an autoregressive fashion; (ii) RefineGNN (Jin et al. 2022a), focusing on heavy chain modeling with autoregressive residue generation; (iii) C-RefineGNN (Kong et al. 2023a), extending RefineGNN to model the entire antibody-antigen complex; (iv) DiffAb (Luo et al. 2022), a diffusion probabilistic model for antibody structure design; (v) MEAN (Kong et al. 2023a), employing E(3)-equivariant message passing mechanism to capture interactions within antibody-antigen complexes; (vi) dyMEAN (Kong et al. 2023b), featuring an adaptive multi-channel equivariant encoder for full-atom geometric feature extraction; (vii) ADesigner (Tan et al. 2024), utilizing a cross-gated MLP for sequence-structure co-learning; (viii) RAAD (Wu et al. 2025), performing relation-aware antibody design in a single forward pass through comprehensive node/edge feature integration; and (ix) AbEgDiffuser (Zhu et al. 2025), integrating bilevel equivariant graph neural networks with evolutionary constraints for diffusion-based antibody codesign.
3.2 Metrics
For sequence-structure co-modeling on the antibody structure database, we quantify sequence recovery using amino acid recovery rate (AAR) coupled with root mean square deviation (RMSD) to assess local conformational accuracy at the atomic level. In the antigen-binding CDR-H3 design task, we additionally employ TM-score (Zhang and Skolnick 2004, Xu and Zhang 2010) to evaluate global structural topology alignment of antibody-antigen complexes alongside AAR and RMSD metrics. For antibody affinity optimization, we utilize binding energy (ΔΔG) as the primary metric, which assesses the model’s capability in enhancing antibody affinity by quantifying thermodynamic changes in binding interactions.
3.3 Sequence and structure modeling
We evaluate the sequence-structure co-modeling capability of our framework on the Structured Antibody Database (SAbDab). The database comprises 9108 antibody structures (current through August 2024). After excluding entries lacking antigen structures, we clustered the remaining 3,127 antibody-antigen complexes using MMSeqs2 following Jin et al.'s (Jin et al. 2022a) protocol, grouping antibodies with >40% CDR sequence identity into the same cluster. This yielded 765, 1093, and 1659 clusters for CDR-H1, CDR-H2, and CDR-H3, respectively. To assess model performance, we partitioned the clustered dataset into training, test, and validation sets with an 8:1:1 ratio. The experiment employed 10-fold cross-validation, with results presented in Table 1. GeoGAD demonstrates superior performance across all CDR regions in the SAbDab benchmark. In terms of sequence recovery (AAR), GeoGAD achieves values of 68.31% for CDR-H1, 61.00% for CDR-H2, and 37.83% for CDR-H3, significantly outperforming all baseline models. GeoGAD also excels in structural accuracy ( -RMSD), reducing RMSD by 0.05 Å (CDR-H1), 0.03 Å (CDR-H2), and 0.08 Å (CDR-H3) relative to the second-best model. Crucially, regarding structural accuracy, GeoGAD consistently outperforms the leading baseline, RAAD, achieving reductions of 0.05 Å, 0.03 Å, and 0.09 Å across the CDR-H1, H2, and H3 regions, respectively. This improvement validates our geometry-aware design strategy. Unlike RAAD, which depends on learned edge relations to infer coordinates, GeoGAD explicitly integrates RoPE and biophysical constraints via the Lennard-Jones potential. These mechanisms enforce stricter geometric consistency and physical plausibility, enabling GeoGAD to recover precise backbone conformations that pure relation-based approaches may miss.
3.4 Antigen-binding CDR-H3 design
We evaluate the model’s capacity for generating antigen-binding CDR-H3 regions using the Rosetta Antibody Design (RAbD) dataset, which comprises 60 antigen-antibody complexes. The SAbDab dataset serves as the training set, excluding antibodies with CDR-H3 regions clustered identically to RAbD entries to prevent data leakage. The remaining data is partitioned into training and test sets at a 9:1 ratio. We include RosettaAD (Adolf-Bryfogle et al. 2018), a conventional benchmark method, for comparison alongside contemporary approaches. As shown in Table 2, our model achieves the highest AAR (41.62%) and the lowest Cα-RMSD (1.40 Å) among all evaluated methods. Although RAAD retains a marginal 0.21% lead in TM-score, GeoGAD surpasses it in structural accuracy with a 0.06 Å reduction in RMSD. This improvement highlights the model’s capacity for fine-grained atomic refinement. The high AAR suggests that our multi-band positional encoding effectively captures the long-range sequence dependencies needed for accurate residue identification. Furthermore, the minimized RMSD confirms that Rotational Position Embeddings (RoPE) and Gaussian attention successfully enforce strict SE(3)-equivariant constraints, focusing the model on critical local interactions. Together, these components prioritize biophysical plausibility over broad topological approximations, rendering GeoGAD highly effective for therapeutic antibody design. Figure 5 presents comparative structural modeling of CDR-H3 regions for two randomly sampled antigen-antibody complexes (PDB: 2DD8 and PDB: 3CX5).
Comparative structural analysis of CDR-H3 regions for two antigen-antibody complexes (PDB: 2DD8, PDB: 3CX5). Generated structures from GeoGAD, RAAD, and ADesigner are superimposed with native structures, accompanied by corresponding RMSD metrics.
3.5 Antibody affinity optimization
Antibody affinity optimization represents a primary objective in therapeutic antibody design. To evaluate the model’s performance in this task, we curated a test dataset comprising 53 high-quality antibody-antigen complex structures selected from the SKEMPI v2.0 database (Jankauskaitė et al. 2019). These complexes cover a diverse range of antigen types and binding interfaces, providing a robust benchmark for optimization. We evaluate for optimized CDR-H3 sequences and structures. Following Kong et al.'s protocol (Kong et al. 2023a), binding energy predictions were performed using a geometric deep learning network (Shan et al. 2022). The iterative target enhancement algorithm (Yang et al. 2020) was employed as the optimization strategy. As presented in Table 3, our framework achieves state-of-the-art performance, with optimized CDR-H3 sequences and structures exhibiting significantly reduced values, indicating enhanced binding affinity. Figure 6 visualizes the structural superposition between optimized and native conformations for two representative antibody-antigen complexes (PDB: 3LZF, optimized : -14.077; PDB: 3SE9, optimized : -12.401).
Structural superposition of affinity-optimized and native CDR-H3 regions for two representative antibody-antigen complexes.
3.6 Ablation study
To systematically assess the contribution of each key component in GeoGAD, we perform ablation studies on two core tasks: Sequence and Structure Modeling and Antigen-Binding CDR-H3 Design. The ablation configurations are as follows: (i) replace rotary position encoding (RoPE) with standard sinusoidal positional encoding; (ii) remove the Lennard-Jones potential term, retaining only the base geometric edge features; (iii) disable the multi-band positional encoder during message updating; (iv) disabling of the Gaussian attention mechanism.
Table 4 demonstrates that, in the modeling task, ablating the Gaussian attention mechanism leads to a significant increase in RMSD, highlighting its critical role in recovering fine-grained local geometry. The removal of the multi-band positional encoder causes the largest drop in amino acid recovery rate (AAR), underscoring its effectiveness in capturing long-range sequential dependencies. In the design task, both the multi-band positional encoder and Gaussian attention are essential: their removal results in substantial degradation in both AAR and TM-score. Notably, while RoPE has minimal impact on modeling performance, its absence in the generation task leads to markedly worse TM-score and higher RMSD, confirming its importance for ensuring the physical validity of generated antibody structures. Finally, although the Lennard-Jones potential does not produce a dramatic performance gain, it consistently yields modest improvements across both tasks.
To systematically assess the influence of the number of frequency bands in the multi-band positional encoding on model performance, we perform an ablation study over . As defined in Equation (4), governs the resolution of geometric signal representation in the multi-frequency space: a small may fail to capture the full range of spatial relationships—from local bond angles to long-range spatial interactions—whereas an excessively large risks introducing redundant frequency components. We therefore evaluate on two core tasks, namely structure modeling and sequence design, with results reported in Table 5.
4 Conclusion
In this paper, we propose Geometry-aware Gaussian Attention-based Antibody Designer (GeoGAD), which demonstrates significant advantages in antibody sequence-structure co-design by incorporating rotational position encoding, multi-level geometric representation modules, and Gaussian attention mechanisms. Experimental results show GeoGAD outperforms or matches state-of-the-art baselines in core metrics, including sequence recovery rate (AAR) and structural accuracy (RMSD, TM-score). Limitations include: (i) Strong data dependency, where performance is constrained by available high-quality antibody-antigen complex structures; (ii) Need for wet-lab validation, as current evaluation relies exclusively on computational metrics.
Based on the research foundation and limitations, future research priorities can advance the following directions: (i) for the data dependency problem, integrating semi-supervised learning and adversarial generative strategies, and integrating large-scale unlabeled antibody sequences and limited high-precision structural data, to improve generalization ability in few-shot scenarios, reducing dependence on high-quality complex structures; (ii) expanding the framework’s geometric modeling capability to antigen epitopes, to adapt to design requirements in practical applications, such as responding to antigen conformational changes caused by pathogen variation or broad-spectrum neutralizing antibody design; (iii) establishing a feedback mechanism combining computational simulation with wet experimental validation, such as combining high-throughput screening platforms to perform in vitro crystal structure analysis of candidate antibodies, forming a data-model-experiment iterative optimization system.
Although the current research is still limited to computational metric validation, the proposed geometric perception paradigm provides a modeling approach with physical constraints for the antibody engineering field, providing a new technical pathway for the development of therapeutic antibodies.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Abanades B , Georges G, Bujotzek A et al A Blooper: fast accurate antibody CDR loop structure prediction with accuracy estimation. Bioinformatics 2022;38:1877–80.35099535 10.1093/bioinformatics/btac 016PMC 8963302 · doi ↗ · pubmed ↗
- 2Adolf-Bryfogle J , Kalyuzhniy O, Kubitz M et al Rosetta Antibody Design (R Ab D): a general framework for computational antibody design. P Lo S Comput Biol 2018;14:e 1006112.29702641 10.1371/journal.pcbi.1006112 PMC 5942852 · doi ↗ · pubmed ↗
- 3Akbar R , Robert PA, PavlovićM et al A compact vocabulary of paratope-epitope interactions enables predictability of antibody-antigen binding. Cell Rep 2021;34:108856.33730590 10.1016/j.celrep.2021.108856 · doi ↗ · pubmed ↗
- 4Cheng J , Liang T, Xie XQ et al A new era of antibody discovery: an in-depth review of AI-driven approaches. Drug Discov Today 2024;29:103984.38642702 10.1016/j.drudis.2024.103984 · doi ↗ · pubmed ↗
- 5Chu AE , Lu T, Huang PS. Sparks of function by de novo protein design. Nat Biotechnol 2024;42:203–15.38361073 10.1038/s 41587-024-02133-2PMC 11366440 · doi ↗ · pubmed ↗
- 6Dauparas J , Anishchenko I, Bennett N et al Robust deep learning–based protein sequence design using Protein MPNN. Science 2022;378:49–56.36108050 10.1126/science.add 2187 PMC 9997061 · doi ↗ · pubmed ↗
- 7Eguchi RR , Choe CA, Huang PS. Ig-VAE: generative modeling of protein structure by direct 3D coordinate generation. P Lo S Comput Biol 2022;18:e 1010271.35759518 10.1371/journal.pcbi.1010271 PMC 9269947 · doi ↗ · pubmed ↗
- 8Gallo E. The rise of big data: deep sequencing-driven computational methods are transforming the landscape of synthetic antibody design. J Biomed Sci 2024;31:29.38491519 10.1186/s 12929-024-01018-5PMC 10943851 · doi ↗ · pubmed ↗
