Discounting and Drug Seeking in Biological Hierarchical Reinforcement Learning
Vardhan Palod, Pranav Mahajan, Veeky Baths, Boris S. Gutkin

TL;DR
This paper models addiction within a hierarchical reinforcement learning framework, showing how increased temporal discounting amplifies drug-seeking behavior and aligns with empirical findings, offering new insights into addiction as a hierarchical decision-making disorder.
Contribution
It introduces a biologically grounded HRL model that integrates discounting effects, explaining the neural and behavioral mechanisms of addiction in a hierarchical context.
Findings
High discounting increases drug-seeking across the hierarchy.
Model aligns with empirical data on temporal discounting in addiction.
Predicts addiction as a disorder of hierarchical decision-making.
Abstract
Despite a strong desire to quit, individuals with long-term substance use disorder (SUD) often struggle to resist drug use, even when aware of its harmful consequences. This disconnect between knowledge and compulsive behavior reflects a fundamental cognitive-behavioral conflict in addiction. Neurobiologically, differential cue-induced activity within striatal subregions, along with dopamine-mediated connectivity from the ventral to the dorsal striatum, contributes to compulsive drug-seeking. However, the functional mechanism linking these findings to behavioral conflict remains unclear. Another hallmark of addiction is temporal discounting: individuals with drug dependence exhibit steeper discount rates than non-users. Assuming the ventral-dorsal striatal organization reflects a gradient from cognitive to motor representations, addiction can be modeled within a hierarchical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural and Behavioral Psychology Studies · Neurotransmitter Receptor Influence on Behavior · Decision-Making and Behavioral Economics
MethodsAttentive Walk-Aggregating Graph Neural Network
