Data Mining Techniques in Predicting Breast Cancer

Hamza Saad; Nagendra Nagarur

arXiv:2011.11088·q-bio.QM·November 24, 2020

Data Mining Techniques in Predicting Breast Cancer

Hamza Saad, Nagendra Nagarur

PDF

TL;DR

This study applies six data mining algorithms to predict breast cancer stages using clinical data, identifying tumor size as a key factor and demonstrating high prediction accuracy with decision trees.

Contribution

The paper compares multiple data mining techniques for breast cancer prediction and highlights the effectiveness of decision trees in extracting significant predictive rules.

Findings

01

Decision trees achieved the highest accuracy among algorithms.

02

Tumor size is a crucial predictor in breast cancer classification.

03

Inheritance, breast side, and menopausal status are less significant variables.

Abstract

Background and Objective: Breast cancer, which accounts for 23% of all cancers, is threatening the communities of developing countries because of poor awareness and treatment. Early diagnosis helps a lot in the treatment of the disease. The present study conducted in order to improve the prediction process and extract the main causes impacted the breast cancer. Materials and Methods: Data were collected based on eight attributes for 130 Libyan women in the clinical stages infected with this disease. Data mining was used by applying six algorithms to predict disease based on clinical stages. All the algorithms gain high accuracy, but the decision tree provides the highest accuracy-diagram of decision tree utilized to build rules from each leafnode. Ranking variables applied to extract significant variables and support final rules to predict disease. Results: All applied algorithms were…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.