# High accuracy breast cancer classification with BIRADS and coclustering

**Authors:** Run Zhou, Xujiang Yu, Jianhao Wang

PMC · DOI: 10.1371/journal.pone.0340772 · PLOS One · 2026-02-09

## TL;DR

This paper introduces a new breast cancer classification method using BI-RADS features and coclustering, improving accuracy and handling data imbalance.

## Contribution

A novel breast cancer classification method using BI-RADS features, improved SMOTE, and coclustering for better accuracy and handling imbalanced data.

## Key findings

- The proposed method improves accuracy, precision, recall, and F1 by more than 5% compared to existing methods.
- The method maintains over 5% higher accuracy under various imbalance ratios.
- Coclustering and improved SMOTE effectively address data imbalance and enhance classification performance.

## Abstract

Breast cancer is one of the most common disease in women. Most of existing breast cancer classification methods include region segmentation, feature extraction and classification phases. It is hard for doctors to understand the conclusion drawn from low level image features. Besides, in cancer hospital more malignant cases than benign cases can be collected, in physical examination center more benign cases can be collected, causing the imbalance problem. To solve above two problems, this study designed a novel breast cancer classification method based on high level Breast Imaging Reporting and Data System (BI-RADS) features. First, an improved Synthetic Minority Oversampling Technique (SMOTE) algorithm is proposed to generate minority samples for balance. Subsequently, coclustering is adopted to mine diagnostic rules. Finally, with Adaboost, the rules can construct a strong classifier. Comparison experiment results on two public datasets shows that the accuracy, precision, recall F1 of proposed method improves more than 5% than comparison methods. Besides, under different imbalance ratios, accuracy of the proposed method is more than 5% higher than comparison methods.

## Linked entities

- **Diseases:** breast cancer (MONDO:0004989)

## Full-text entities

- **Diseases:** Breast cancer (MESH:D001943), cancer (MESH:D009369)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12885307/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12885307/full.md

## References

50 references — full list in the complete paper: https://tomesphere.com/paper/PMC12885307/full.md

---
Source: https://tomesphere.com/paper/PMC12885307