# An effective fragment-based dual conditional diffusion framework for molecular generation

**Authors:** Haotian Chen, Yiting Shen, Jichun Li, Weizhong Zhao

PMC · DOI: 10.1093/bib/bbaf727 · 2026-01-19

## TL;DR

This paper introduces a new method for generating molecules that better fits 3D protein structures while maintaining chemical validity and drug-like properties.

## Contribution

A dual conditional diffusion framework that separately models scaffold and R-group generation with structural and chemical constraints.

## Key findings

- FDC-Diff outperforms existing methods on multiple SBDD benchmarks in terms of chemical validity and spatial compatibility.
- The model generates pharmacologically relevant molecules by integrating curated reaction rules and physical-chemistry refinement.
- Decomposing molecule generation into scaffold and R-group stages improves both global topology and local interaction modeling.

## Abstract

Fragment-based molecular generation has emerged as a promising paradigm in structure-based drug design (SBDD), deriving effective compounds with advanced properties, including chemical validity, synthetic feasibility, pharmacological relevance, etc. However, existing approaches often struggle with generating molecules which can both conform to 3D structural constraints and retain chemical plausibility. This is largely due to the fact that prior works often treat scaffolds and R-groups of molecules indiscriminately, overlooking the distinct semantic roles played by scaffolds and R-groups. Specifically, the scaffold serves as the rigid structural backbone that determines the global geometric topology and binding pose, whereas R-groups act as functional substituents responsible for fine-tuning local physicochemical interactions. Therefore, in this work, we propose fragment-based dual conditional diffusion (FDC-Diff), a novel dual conditional diffusion framework that integrates chemical priors and structural cues for fragment-based molecular generation. Unlike traditional de novo methods that generate atoms sequentially, FDC-Diff decomposes the molecule generation process into two semantically complementary stages. Given the protein pocket and an initial fragment, in the first stage, a spatially constrained scaffold is constructed to capture the global molecular topology. In the second stage, R-groups onto the obtained scaffold are elaborated to capture local semantics to further refine molecular properties. To ensure synthetic accessibility, initial fragments and scaffold-modification hierarchy are derived from curated reaction rules, and a physical-chemistry-inspired refinement step is applied to optimize final conformations. Experimental results on multiple SBDD benchmarks demonstrate that FDC-Diff achieves state-of-the-art performance in terms of comprehensive evaluations. Furthermore, our model excels at producing chemically valid, spatially compatible, and pharmacologically relevant molecules, suggesting its potential as a feasible tool for fragment-based drug design.

## Full-text entities

- **Genes:** BCHE (butyrylcholinesterase) [NCBI Gene 590] {aka BCHED, CHE1, CHE2, E1}, ASPM (assembly factor for spindle microtubules) [NCBI Gene 259266] {aka ASP, Calmbp1, MCPH5}, TTC41P (tetratricopeptide repeat domain 41, pseudogene) [NCBI Gene 253724] {aka GNN, GNNP}, ACHE (acetylcholinesterase (Yt blood group)) [NCBI Gene 43] {aka ACEE, ARACHE, N-ACHE, YT}
- **Diseases:** hallucinations (MESH:D006212), syncope (MESH:D013575), neurodegenerative disorder (MESH:D019636), gastrointestinal disturbances (MESH:D005767), AD (MESH:D000544), neuropsychiatric symptoms (MESH:D001523), anxiety (MESH:D001007), cytotoxicity (MESH:D064420), impaired daily functioning (MESH:D020773), urinary tract infections (MESH:D014552), nausea (MESH:D009325), diarrhea (MESH:D003967), aggression (MESH:D010554), bradycardia (MESH:D001919), memory loss (MESH:D008569), cognitive decline (MESH:D003072)
- **Chemicals:** -methylene--butyrolactone (-), gatifloxacin (MESH:D000077734), O (MESH:D010100), N (MESH:D009584), norfloxacin (MESH:D009643), carbon (MESH:D002244), Quinolone (MESH:D015363), eriolangin (MESH:C504937), ciprofloxacin (MESH:D002939), gemifloxacin (MESH:D000077735), fluorine (MESH:D005461), arglabin (MESH:C083346), octanol (MESH:D000442), nalidixic acid (MESH:D009268), water (MESH:D014867), levofloxacin (MESH:D064704), acetylcholine (MESH:D000109), moxifloxacin (MESH:D000077266), Hydrogen (MESH:D006859), helenalin (MESH:C001329)
- **Cell lines:** FLAG — Homo sapiens (Human), Induced pluripotent stem cell (CVCL_C0IU)

## Figures

18 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12814976/full.md

---
Source: https://tomesphere.com/paper/PMC12814976