Breast Cancer Data Analytics With Missing Values: A study on Ethnic, Age and Income Groups
Santosh Tirunagari, Norman Poh, Hajara Abdulrahman, Nawal Nemmour and, David Windridge

TL;DR
This study analyzes breast cancer survival rates across different ethnic, income, and age groups in South East England, addressing missing data and revealing disparities in survival outcomes.
Contribution
The paper reports on survival rate analysis and introduces methods for predicting missing ethnicity data in a large breast cancer dataset.
Findings
White women have higher survival rates than Black women.
Higher income groups show better survival outcomes.
Older age groups (80-95) have lower survival rates.
Abstract
An analysis of breast cancer incidences in women and the relationship between ethnicity and survival rate has been an ongoing study with recorded incidences of missing values in the secondary data. In this paper, we study and report the results of breast cancer survival rate by ethnicity, age and income groups from the dataset collected for 53593 patients in South East England between the years 1998 and 2003. In addition to this, we also predict the missing values for the ethnic groups in the dataset. The principle findings in our study suggest that: 1) women of white ethnicity in South East England have a highest percentage of survival rate when compared to the black ethnicity, 2) High income groups have higher survival rates to that of lower income groups and 3) Age groups between 80-95 have lower percentage of survival rate.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGlobal Cancer Incidence and Screening · Colorectal Cancer Screening and Detection · Statistical Methods and Inference
