# Machine Learning‐Guided Repositioning of a SARS‐CoV‐2‐Targeting Molecular Series as Cruzain Inhibitors

**Authors:** Rafael F. Lameiro, Luiz F. Barbosa, Evelin R. Cardoso, Beatriz Siqueira Ho, Felipe Cardoso Prado Martins, Bruna C. de Melo, Fabiana Rosini, Anwar Shamim, Priscila M. Souza, Wellington Falcão de Souza, Carlos A. Montanari

PMC · DOI: 10.1002/cmdc.202500630 · 2026-01-31

## TL;DR

A machine learning model repurposes a library of compounds originally designed for SARS-CoV-2 to find new inhibitors for cruzain, a key enzyme in Chagas disease.

## Contribution

Using ML to reposition a SARS-CoV-2-targeting compound library for cruzain inhibition, identifying novel P1 moieties and validating drug-like properties.

## Key findings

- High-affinity cruzain inhibitors with novel P1 moieties were identified from a SARS-CoV-2-targeted compound library.
- Selected inhibitors showed favorable enthalpic and entropic contributions to binding without highly lipophilic R-groups.
- The study demonstrates how global health compound libraries can be repurposed for neglected tropical diseases using ML.

## Abstract

Drug repurposing and repositioning are concepts that involve identifying alternative therapeutic uses for existing drug candidates or molecular series. During the COVID‐19 pandemic, hundreds of antivirals were developed, many of which remain unexplored for other diseases. Concurrently, machine learning (ML) has become a valuable tool in early drug discovery for screening the most promising compounds for a target. In this work, an ExtraTrees ML model is developed to predict inhibitory activity against cruzain, the main cysteine protease of Trypanosoma cruzi, the causative agent of Chagas disease. The model is used to screen a proprietary library of peptidomimetic compounds originally designed to target SARS‐CoV‐2 Mpro and human cathepsin L. High‐affinity cruzain inhibitors are identified, some containing P1 moieties not previously reported in cruzain inhibitors, expanding the known chemical space for this target. Selected hits are validated using isothermal titration calorimetry and some compounds display more favorable enthalpic and entropic contributions to binding than similar peptidomimetic nitrile‐based inhibitors. Notably, this is achieved without highly lipophilic R‐groups, preserving drug‐like properties. This work also highlights how compound libraries derived from global health efforts can be effectively repurposed for neglected tropical diseases with ML models.

An ExtraTrees machine learning (ML) model is used to reposition a SARS‐CoV‐2‐directed library of peptidomimetic compounds as cruzain inhibitors for Chagas disease. Potent binders with a novel P1 moiety (for cruzain) are identified and validated by calorimetry. This work demonstrates how pandemic‐era drug libraries and ML tools can accelerate discovery for neglected diseases.© 2026 WILEY‐VCH GmbH

## Linked entities

- **Diseases:** Chagas disease (MONDO:0001444), COVID-19 (MONDO:0100096)
- **Species:** Trypanosoma cruzi (taxon 5693)

## Full-text entities

- **Genes:** Mpro [NCBI Gene 8673700], CTSL (cathepsin L) [NCBI Gene 1514] {aka CATL, CTSL1, MEP}
- **Diseases:** COVID-19 (MESH:D000086382), Chagas disease (MESH:D014355), tropical diseases (MESH:D015493)
- **Chemicals:** nitrile (MESH:D009570)
- **Species:** Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049], Trypanosoma cruzi (species) [taxon 5693], Homo sapiens (human, species) [taxon 9606]

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12860495/full.md

---
Source: https://tomesphere.com/paper/PMC12860495