# Machine Learning-Guided Development of Anti-Tuberculosis Dry Powder for Inhalation Prepared by Co-Spray Drying

**Authors:** Xiaoyun Hu, Xian Chen, Ziling Zhou, Aichao Wang, Xin Pan, Chuanbin Wu, Junhuang Jiang

PMC · DOI: 10.3390/pharmaceutics18020191 · 2026-02-01

## TL;DR

This study uses machine learning to develop better dry powder inhalers for tuberculosis drugs, improving lung delivery and reducing side effects.

## Contribution

The novel use of machine learning to optimize anti-TB dry powder formulations for inhalation is introduced.

## Key findings

- Rifampin-L-lysine acetate and pyrazinamide-L-leucine formulations achieved high fine particle fractions and optimal MMAD.
- XGBoost demonstrated strong predictive performance with R2 values up to 0.991 for DPI formulation predictions.
- Molecular weights and LogP of drug-amino acid combinations were identified as key features influencing aerodynamic performance.

## Abstract

Background/Objectives: Tuberculosis (TB) remains a major global health threat. Current administration methods for anti-TB drugs, including oral or intravenous, suffer from systemic side effects, low lung distribution, and poor patient compliance. Dry powder inhalers (DPIs) offer a promising alternative. This study investigates the aerodynamic performance of co-spray-dried DPIs containing rifampin or pyrazinamide and amino acids by using machine learning. Methods: Firstly, 72 formulations were prepared by varying drug-amino acid combinations, molar ratios, and spray-drying parameters. Subsequently, the aerodynamic performance of all 72 formulations was evaluated using a Next Generation Impactor, and the solid-state characterizations of optimal DPIs were carried out. Finally, four machine learning (ML) models were successfully developed and were utilized to predict the fine particle dose (FPD), FPF, MMAD, and geometric standard deviation (GSD) of DPIs based on the high-quality in-house data above. Results: Key results showed that the aerodynamic performance of DPIs was highly dependent on the specific drug-amino acid combination, with rifampin-L-lysine acetate and pyrazinamide-L-leucine formulations achieving the highest fine particle fraction (FPF, 73.37%, 87.74%) and optimal mass median aerodynamic diameter (MMAD, 2.59 µm, 1.88 µm). Notably, XGBoost (v3.1.3) exhibited the best predictive performance, with R2 values ranging from 0.894 to 0.991 in the testing set for the four prediction tasks. Meanwhile, SHapley Additive exPlanations (v0.50.0) was used for model interpretability analysis. The molecular weights and LogP of the drug and amino acid were identified as two of the most important features affecting the prediction of FPD, FPF, MMAD, and GSD. Conclusions: This work demonstrates the feasibility of ML in accelerating the development of inhalable spray-dried anti-TB drugs by enabling the prediction of DPI formulations.

## Linked entities

- **Chemicals:** rifampin (PubChem CID 135398735), pyrazinamide (PubChem CID 1046), L-lysine acetate (PubChem CID 104152), L-leucine (PubChem CID 857)
- **Diseases:** Tuberculosis (MONDO:0018076)

## Full-text entities

- **Genes:** SHROOM4 (shroom family member 4) [NCBI Gene 57477] {aka MRXSSDS, SHAP, shrm4}, RHOF (ras homolog family member F, filopodia associated) [NCBI Gene 54509] {aka ARHF, RIF}
- **Diseases:** ML (MESH:D007859), DPIs (MESH:D015208), injury to (MESH:D014947), GSD (MESH:D010262), TB (MESH:D014376), MMAD (MESH:D020423), death (MESH:D003643), FPD (MESH:D014202), infected (MESH:D007239), pulmonary TB (MESH:D014397), gastrointestinal disturbances (MESH:D005767)
- **Chemicals:** water (MESH:D014867), L-leucin (MESH:D007930), ethanol (MESH:D000431), acetic acid (MESH:D019342), PYR (MESH:D011718), acid (MESH:D000143), isoniazid (MESH:D007538), gold (MESH:D006046), methanol (MESH:D000432), HPMC (MESH:D065347), acetonitrile (MESH:C032159), carbon (MESH:D002244), Potassium dihydrogen phosphate (MESH:C013216), nitrogen (MESH:D009584), citric acid (MESH:D019343), Tween 20 (MESH:D011136), L-Lysine (MESH:D008239), Hydrogen (MESH:D006859), Breezhaler (-), trileucine (MESH:C036671), Rifampicin (MESH:D012293), Sodium thiosulfate (MESH:C017717), L-arginine (MESH:D001120), amino acid (MESH:D000596), ethambutol (MESH:D004977)
- **Species:** Mycobacterium tuberculosis complex (species group) [taxon 77643], Legionella sp. L (species) [taxon 74303], Homo sapiens (human, species) [taxon 9606], Mycobacterium tuberculosis (species) [taxon 1773]

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12943867/full.md

---
Source: https://tomesphere.com/paper/PMC12943867