Detecting ulcerative colitis from colon samples using efficient feature selection and machine learning
Hanieh Marvi Khorasani, Hamid Usefi, and Lourdes Pe\~na-Castillo

TL;DR
This study develops a machine learning model using feature selection and SVM to accurately detect ulcerative colitis from gene expression data, outperforming previous methods.
Contribution
It introduces a novel combination of DRPT feature selection with SVM for UC detection, validated on independent datasets with superior performance.
Findings
Perfect detection of active UC cases.
Average precision of 0.62 for inactive cases.
Outperforms previous models and software in accuracy.
Abstract
Ulcerative colitis (UC) is one of the most common forms of inflammatory bowel disease (IBD) characterized by inflammation of the mucosal layer of the colon. Diagnosis of UC is based on clinical symptoms, and then confirmed based on endoscopic, histologic and laboratory findings. Feature selection and machine learning have been previously used for creating models to facilitate the diagnosis of certain diseases. In this work, we used a recently developed feature selection algorithm (DRPT) combined with a support vector machine (SVM) classifier to generate a model to discriminate between healthy subjects and subjects with UC based on the expression values of 32 genes in colon samples. We validated our model with an independent gene expression dataset of colonic samples from subjects in active and inactive periods of UC. Our model perfectly detected all active cases and had an average…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFeature Selection
