Sample-Efficient Reinforcement Learning Controller for Deep Brain Stimulation in Parkinson's Disease
Harsh Ravivarapu, Gaurav Bagwe, Xiaoyong Yuan, Chunxiu Yu, Lan Zhang

TL;DR
This paper introduces SEA-DBS, a novel reinforcement learning framework that enhances adaptive deep brain stimulation for Parkinson's disease by improving sample efficiency, stability, and hardware compatibility.
Contribution
SEA-DBS combines a predictive reward model and Gumbel Softmax exploration to address RL challenges in neurostimulation, enabling faster and more robust adaptive control.
Findings
Faster convergence in simulation
Stronger suppression of beta-band oscillations
Resilience to FP16 quantization
Abstract
Deep brain stimulation (DBS) is an established intervention for Parkinson's disease (PD), but conventional open-loop systems lack adaptability, are energy-inefficient due to continuous stimulation, and provide limited personalization to individual neural dynamics. Adaptive DBS (aDBS) offers a closed-loop alternative, using biomarkers such as beta-band oscillations to dynamically modulate stimulation. While reinforcement learning (RL) holds promise for personalized aDBS control, existing methods suffer from high sample complexity, unstable exploration in binary action spaces, and limited deployability on resource-constrained hardware. We propose SEA-DBS, a sample-efficient actor-critic framework that addresses the core challenges of RL-based adaptive neurostimulation. SEA-DBS integrates a predictive reward model to reduce reliance on real-time feedback and employs Gumbel Softmax-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeurological disorders and treatments · EEG and Brain-Computer Interfaces · Transcranial Magnetic Stimulation Studies
