# Precision fragment addition: domain-specific DeepFrag2 models for smarter lead optimization

**Authors:** César R. García-Jacas, Harrison Green, Shayne D. Wierbowski, Jacob D. Durrant

PMC · DOI: 10.1039/d5dd00425j · Digital Discovery · 2026-02-25

## TL;DR

DeepFrag2 is an improved machine-learning tool that helps chemists optimize drug candidates by adding suitable molecular fragments.

## Contribution

The study introduces DeepFrag2, which enhances lead optimization accuracy by using domain-specific models trained on fragment properties and receptor classes.

## Key findings

- DeepFrag2 models trained on specific fragment properties improve prediction accuracy.
- Fine-tuning DeepFrag2 on specific receptor classes further boosts performance.
- DeepFrag2 is released as open-source software for free use.

## Abstract

This study introduces a series of machine-learning models based on DeepFrag, our previously published tool designed to guide small-molecule lead optimization through fragment addition. We demonstrate enhanced accuracy by training new DeepFrag models to predict optimizing fragments with specific sizes and chemical properties. Additionally, we show that DeepFrag accuracy improves when fine-tuned on specific receptor classes. These targeted models should prove valuable for medicinal chemists with predetermined insights into suitable molecular fragment characteristics (such as preferred size ranges, charge states, or aromaticity) or those conducting optimization campaigns against specific drug-target classes with many known ligands. To encourage adoption, we release DeepFrag2 under the open-source MIT license. Interested users can download DeepFrag2 free of charge without registration from https://durrantlab.com/deepfrag2/.

DeepFrag2, a machine-learning tool for lead optimization via fragment addition, is more accurate when trained on fragments with specific sizes, charge states, or aromaticity. Fine-tuning on specific receptor classes further boosts performance.

## Full-text entities

- **Genes:** BACE1 (beta-secretase 1) [NCBI Gene 23621] {aka ASP2, BACE, HSPC104}, CA2 (carbonic anhydrase 2) [NCBI Gene 760] {aka CA-II, CAC, CAII, Car2, HEL-76, HEL-S-282}, TOP1 (DNA topoisomerase I) [NCBI Gene 7150] {aka TOPI}
- **Diseases:** MOAD (MESH:C536496), toxicity (MESH:D064420)
- **Chemicals:** nitrogen (MESH:D009584), phosphonates (MESH:D063065), phosphorus (MESH:D010758), amine (MESH:D000588), GMP (MESH:C066524), sulfur (MESH:D013455), imines (MESH:D007097), guanidines (MESH:D006146), phosphates (MESH:D010710), sulfonamides (MESH:D013449), arginine (MESH:D001120), sulfonates (MESH:D000476), tetrazole (MESH:C045574), alcohol (MESH:D000438), GDP (MESH:D006153), amidines (MESH:D000578), thiol (MESH:D013438), carbon (MESH:D002244), cyclic-GMP (MESH:D006152), oxygen (MESH:D010100), DeepFrag2 (-), sulfates (MESH:D013431), hydrogen (MESH:D006859)
- **Species:** Homo sapiens (human, species) [taxon 9606], Human immunodeficiency virus 1 (no rank) [taxon 11676]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12935096/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12935096/full.md

## References

22 references — full list in the complete paper: https://tomesphere.com/paper/PMC12935096/full.md

---
Source: https://tomesphere.com/paper/PMC12935096