Feature Selection on Lyme Disease Patient Survey Data
Joshua Vendrow, Jamie Haddock, Deanna Needell, and Lorraine Johnson

TL;DR
This study applies various machine learning techniques to Lyme disease patient survey data to identify key features influencing treatment response and disease progression, aiming to improve understanding and diagnosis.
Contribution
It introduces a comprehensive machine learning approach to feature selection in Lyme disease patient data, highlighting important predictors of treatment outcomes.
Findings
Key features identified for predicting patient responses
Machine learning models effectively distinguish important clinical factors
Potential directions for future mathematical and clinical research
Abstract
Lyme disease is a rapidly growing illness that remains poorly understood within the medical community. Critical questions about when and why patients respond to treatment or stay ill, what kinds of treatments are effective, and even how to properly diagnose the disease remain largely unanswered. We investigate these questions by applying machine learning techniques to a large scale Lyme disease patient registry, MyLymeData, developed by the nonprofit LymeDisease.org. We apply various machine learning methods in order to measure the effect of individual features in predicting participants' answers to the Global Rating of Change (GROC) survey questions that assess the self-reported degree to which their condition improved, worsened, or remained unchanged following antibiotic treatment. We use basic linear regression, support vector machines, neural networks, entropy-based decision tree…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Epidemiology · Genetic and phenotypic traits in livestock
