Investigate the Correlation of Breast Cancer Dataset using Different   Clustering Technique

Somenath Chakraborty; Beddhu Murali

arXiv:2109.01538·cs.LG·September 6, 2021·1 cites

Investigate the Correlation of Breast Cancer Dataset using Different Clustering Technique

Somenath Chakraborty, Beddhu Murali

PDF

Open Access

TL;DR

This paper explores various clustering techniques and preprocessing methods to analyze breast cancer data in an unsupervised manner, aiming to enhance medical prognosis systems by identifying data correlations.

Contribution

It compares multiple clustering methods and preprocessing steps to determine the most effective approach for analyzing breast cancer datasets without prior training.

Findings

01

K-means and PAM clustering techniques reveal significant data correlations.

02

Preprocessing improves clustering accuracy and robustness.

03

The study provides insights for designing better prognosis systems.

Abstract

The objectives of this paper are to explore ways to analyze breast cancer dataset in the context of unsupervised learning without prior training model. The paper investigates different ways of clustering techniques as well as preprocessing. This in-depth analysis builds the footprint which can further use for designing a most robust and accurate medical prognosis system. This paper also give emphasis on correlations of data points with different standard benchmark techniques. Keywords: Breast cancer dataset, Clustering Technique Hopkins Statistic, K-means Clustering, k-medoids or partitioning around medoids (PAM)

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in cancer detection