# Detection and classification of venous thromboembolism through image test reports analysis using active learning and deep learning

**Authors:** Eunseon Jeong, Youngje Woo, Hong Joo Lee, Jang Yong Kim

PMC · DOI: 10.1371/journal.pone.0335262 · PLOS One · 2025-11-10

## TL;DR

This study shows that deep learning models can accurately classify venous thromboembolism reports with high efficiency and robustness, even with limited data.

## Contribution

DL models with active learning outperformed traditional ML in VTE classification, especially under class imbalance and with minimal labeled data.

## Key findings

- DL models achieved F1 scores ≥0.94 in VTE classification despite class imbalance.
- DL models had inference times suitable for real-time clinical use.
- Excluding a class reduced performance, highlighting the importance of class diversity.

## Abstract

This study evaluated whether active learning can enable efficient classification of venous thromboembolism (VTE) reports using minimal labeled data. In parallel, we assessed whether deep learning (DL) models can achieve substantially superior performance compared to traditional machine learning (ML) models and overcome the limitations associated with small sample sizes and class imbalance in real-world clinical datasets.

5,839 imaging reports with, of which 1,088 (18.6%) were VTE-positive. Traditional ML models (RF, SVM, SVM with SGD, GBM) were combined with active learning strategies (random sampling, uncertainty-based, word similarity, TF-IDF similarity). DL models (LSTM, multi-kernel 1D-CNN with GloVe, BERT-based models) were also evaluated. F1 scores were used as the performance metric.

Among VTE-positive patients, 65.0% had corresponding ICD-10 codes, indicating frequent under-documentation. ML models with active learning achieved F1 scores of 0.70–0.80, while DL models, particularly LSTM and multi-kernel 1D-CNN with GloVe achieved F1 scores ≥0.94 in a 7-class classification, even under severe class imbalance. Excluding the “No DVT and PE” class for a 6-class classification among VTE-positive cases led to reduced model performance, with the largest decline observed in BioBERT. The average inference time per report ranged from 0.0014 to 0.024 seconds depending on the model architecture, suggesting that the system is feasible for near real-time deployment in clinical settings.

DL models substantially outperformed traditional ML in classifying VTE reports, with high accuracy, acceptable inference time, and robustness to class imbalance. These models hold promise for augmenting clinical workflows, particularly in addressing under-coded but clinically significant VTE cases.

## Linked entities

- **Diseases:** venous thromboembolism (MONDO:0005399)

## Full-text entities

- **Diseases:** DVT (OMIM:612862), VTE (MESH:D054556)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12599933/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12599933/full.md

## References

26 references — full list in the complete paper: https://tomesphere.com/paper/PMC12599933/full.md

---
Source: https://tomesphere.com/paper/PMC12599933