# Simultaneous Quantitative Analysis of Polymorphic Impurities in Canagliflozin Tablets Utilizing Near-Infrared Spectroscopy and Partial Least Squares Regression

**Authors:** Mingdi Liu, Rui Fu, Guiyu Xu, Weibing Dong, Huizhi Qi, Peiran Dong, Ping Song

PMC · DOI: 10.3390/molecules31020230 · 2026-01-09

## TL;DR

This paper introduces a fast and non-destructive method using near-infrared spectroscopy and statistical modeling to measure impurities in diabetes drug tablets.

## Contribution

A novel method combining near-infrared spectroscopy and partial least squares regression for simultaneous quantification of polymorphic impurities in canagliflozin tablets.

## Key findings

- A PLSR model achieved an R2 value of 0.9919 for quantifying An-CFZ and Mono-CFZ in CFZ tablets.
- The NIR-PLSR approach was validated for reliability and feasibility in monitoring polymorphic impurities.
- The study explains the quantitative analysis mechanism through variance contributions and correlations in latent variables.

## Abstract

Canagliflozin (CFZ), a sodium–glucose cotransporter 2 (SGLT2) inhibitor, is extensively utilized in the management of type 2 diabetes. Among its various polymorphic forms, the hemi-hydrate (Hemi-CFZ) has been selected as the active pharmaceutical ingredient (API) for CFZ tablets due to its superior solubility. However, during the production, storage, and transportation of CFZ tablets, Hemi-CFZ can undergo transformations into anhydrous (An-CFZ) and monohydrate (Mono-CFZ) forms under the influence of environmental factors such as temperature, humidity, and pressure, which may adversely impact the bioavailability and clinical efficacy of CFZ tablets. Therefore, it is imperative to develop rapid, accurate, non-destructive, and non-contact methods for quantifying An-CFZ and Mono-CFZ content in CFZ tablets to control polymorphic impurity levels and ensure product quality. This research evaluated the feasibility and reliability of using near-infrared spectroscopy (NIR) combined with partial least squares regression (PLSR) for simultaneous quantitative analysis of An-CFZ and Mono-CFZ in CFZ tablets, elucidating the quantifying mechanisms of the quantitative analysis model. Orthogonal experiments were designed to investigate the effects of different pretreatment methods and ant colony optimization (ACO) algorithms on the performance of quantitative models. An optimal PLSR model for simultaneous quantification of An-CFZ and Mono-CFZ in CFZ tablets was established and validated over a concentration range of 0.0000 to 10.0000 w/w%. The resulting model, YAn-CFZ/Mono-CFZ = 0.0207 + 0.9919 X, achieved an R2 value of 0.9919. By analyzing the relationship between the NIR spectral signals selected by the ACO algorithm and the molecular structure information of An-CFZ and Mono-CFZ, we demonstrated the feasibility and reliability of the NIR-PLSR approach for quantifying these polymorphic forms. Additionally, the mechanism of PLSR quantitative analysis was further explained through the variance contribution rates of latent variables (LVs), the correlations between LVs loadings and tablets composition, and the relationships between LV scores and An-CFZ/Mono-CFZ content. This study not only provides a robust method and theoretical foundation for monitoring An-CFZ and Mono-CFZ content in CFZ tablets throughout production, processing, storage, and transportation, but also offers a reliable methodological reference for the simultaneous quantitative analysis and quality control of multiple polymorphic impurities in other similar drugs.

## Linked entities

- **Chemicals:** Canagliflozin (PubChem CID 24812758)
- **Diseases:** type 2 diabetes (MONDO:0005148)

## Full-text entities

- **Genes:** SLC5A2 (solute carrier family 5 member 2) [NCBI Gene 6524] {aka SGLT2}
- **Diseases:** type 2 diabetes (MESH:D003924)
- **Chemicals:** API (-), CFZ (MESH:D000068896)

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12843906/full.md

---
Source: https://tomesphere.com/paper/PMC12843906