# Toxic chinese herbal medicine recognition in real-world images via multi-scale and attention-enhanced EfficientNetV2

**Authors:** Guohua Zhu, Jaehan Joo, Seonghyeon Park, Suk Chan Kim

PMC · DOI: 10.1371/journal.pone.0344262 · PLOS One · 2026-03-19

## TL;DR

This paper introduces a new AI model to accurately identify toxic Chinese herbal medicines in real-world images, improving public health and safety.

## Contribution

A novel EfficientNetV2-based architecture with multi-scale feature fusion and attention modules for robust toxic herb classification.

## Key findings

- The proposed model achieves 91.28% Top-1 accuracy in toxic herb classification.
- MSFF and CBAM modules significantly improve performance on challenging real-world images.
- The model outperforms existing baselines in accuracy and robustness.

## Abstract

Accurate identification of toxic Chinese herbal medicines is critical for public health and clinical safety. However, real-world herbal images often exhibit complex backgrounds and small, indistinct target regions, posing substantial challenges to automated classification systems. In this study, we present a novel image dataset comprising over 4,000 samples from 47 toxic herb categories, captured under diverse environmental conditions to reflect real-world variability. We benchmark several state-of-the-art convolutional neural networks, including ResNet, ResNeXt, and EfficientNet variants, and identify EfficientNetV2 as the most effective baseline. To further enhance model robustness and discriminative capability, we propose an improved EfficientNetV2 architecture incorporating two lightweight yet effective modules: a Multi-Scale Feature Fusion (MSFF) module to integrate hierarchical features, and a Convolutional Block Attention Module (CBAM) to refine both spatial and channel-wise representations. Experimental results demonstrate that our enhanced model achieves 91.28% Top-1 accuracy, 97.52% Top-5 accuracy, and a 90.27% macro F1-score, significantly outperforming baseline methods. Ablation studies confirm the complementary benefits of MSFF and CBAM, and targeted evaluations on challenging image subsets reveal improved resilience to background clutter and small object localization. The proposed architecture offers a high-accuracy, generalizable, and computationally efficient solution for toxic herbal medicine classification and provides a valuable reference for intelligent traditional medicine recognition applications. The GitHub repository for this project is available at: https://github.com/zhuguohua1992/Toxic-chinese-herbal-medicine-recognition-via-enhanced-EfficientNetV2.

## Full-text entities

- **Diseases:** organ failure (MESH:D009102), allergic reactions (MESH:D004342), CBAM (MESH:D001289), MIL (MESH:D007859), Toxic (MESH:D064420)
- **Chemicals:** CBAM (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13002182/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13002182/full.md

## References

27 references — full list in the complete paper: https://tomesphere.com/paper/PMC13002182/full.md

---
Source: https://tomesphere.com/paper/PMC13002182