Machine Learning Techniques Used for the Identification of Sociodemographic Factors Associated With Cancer: Systematic Literature Review
Liz González-Infante, Gaston Marquez, Solange Parra-Soto, Mónica Cardona-Valencia, Carla Taramasco

TL;DR
This paper reviews how machine learning is used to study the link between sociodemographic factors and cancer outcomes, highlighting gaps and opportunities for more equitable cancer care.
Contribution
The paper systematically reviews ML applications in identifying sociodemographic factors linked to cancer outcomes, emphasizing methodological trends and limitations.
Findings
Most studies used supervised ML techniques like random forest and extreme gradient boosting.
Common sociodemographic variables included age, gender, education, income, and geographic location.
External validation and integration of clinical data remain limited in current research.
Abstract
Cancer remains one of the foremost global causes of mortality, with nearly 10 million deaths recorded by 2020. As incidence rates rise, there is a growing interest in leveraging machine learning (ML) to enhance prediction, diagnosis, and treatment strategies. Despite these advancements, insufficient attention has been directed toward the integration of sociodemographic variables, which are crucial determinants of health equity, into ML models in oncology. This review aims to investigate how ML techniques have been used to identify patterns of predictive association between sociodemographic factors and cancer-related outcomes. Specifically, it seeks to map current research endeavors by detailing the types of algorithms used, the sociodemographic variables examined, and the validation methodologies used. We conducted a systematic literature review in accordance with the PRISMA…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Global Cancer Incidence and Screening · Artificial Intelligence in Healthcare and Education
