# Harnessing machine learning and multivariate analysis to explore global trends in Cannabis sativa research

**Authors:** Javier De La Hoz-M, Karime Montes-Escobar, Carlos Alfredo Salas-Macias

PMC · DOI: 10.1186/s42238-026-00397-w · 2026-02-21

## TL;DR

This study uses data science to analyze global Cannabis sativa research trends from 1974 to 2024, revealing geographic and thematic patterns.

## Contribution

A novel framework combining LDA and HJ-Biplot methods for analyzing and visualizing research trends in Cannabis sativa.

## Key findings

- Medical applications of Cannabis sativa are increasingly studied in North America and Europe.
- Research gaps persist in Africa and South America.
- The combined method offers a replicable approach for managing large bibliographic datasets.

## Abstract

This study employs advanced data science techniques to explore global research trends in Cannabis sativa from 1974 to 2024. This research integrated bibliographic datasets from PubMed, Scopus, and Web of Science. By combining latent Dirichlet allocation (LDA) and HJ-Biplot methods, we extracted actionable insights from large-scale data to address the current gap in long-term global research monitoring. The analysis identified key research topics, geographic disparities, and temporal trends, providing a comprehensive overview of the evolution of Cannabis sativa studies. The results highlight an increasing focus on the medical applications of Cannabis sativa, particularly in North America and Europe, while highlighting research gaps in emerging regions such as Africa and South America. Furthermore, the integration of multivariate methods with machine learning offers a replicable framework for managing large bibliographic datasets and enhancing data-driven decision-making in research management. Additionally, combining topic modeling with multivariate visualization provides a novel framework to understand how research themes evolve and interact. This approach serves as a strategic tool for stakeholders navigating the rapidly changing cannabis field.

## Linked entities

- **Species:** Cannabis sativa (taxon 3483)

## Full-text entities

- **Species:** Cannabis sativa (species) [taxon 3483]

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13032642/full.md

---
Source: https://tomesphere.com/paper/PMC13032642