Predicting colorectal polyp recurrence using time-to-event analysis of medical records
Lia X. Harrington, Jason W. Wei, Arief A. Suriawinata, Todd A., Mackenzie, Saeed Hassanpour

TL;DR
This study uses natural language processing and survival analysis to identify key patient and polyp features that influence colorectal polyp recurrence, aiming to improve personalized surveillance strategies.
Contribution
It introduces a novel approach combining NLP and survival models to predict polyp recurrence risk based on electronic medical records.
Findings
Polyp size, number, and location significantly affect recurrence risk.
Right-sided colon polyps increase recurrence risk by 30%.
A random survival forest model achieved an AUC of 0.65.
Abstract
Identifying patient characteristics that influence the rate of colorectal polyp recurrence can provide important insights into which patients are at higher risk for recurrence. We used natural language processing to extract polyp morphological characteristics from 953 polyp-presenting patients' electronic medical records. We used subsequent colonoscopy reports to examine how the time to polyp recurrence (731 patients experienced recurrence) is influenced by these characteristics as well as anthropometric features using Kaplan-Meier curves, Cox proportional hazards modeling, and random survival forest models. We found that the rate of recurrence differed significantly by polyp size, number, and location and patient smoking status. Additionally, right-sided colon polyps increased recurrence risk by 30% compared to left-sided polyps. History of tobacco use increased polyp recurrence risk…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsColorectal Cancer Screening and Detection · Gastric Cancer Management and Outcomes · Diverticular Disease and Complications
