A neural network model for timing control with reinforcement
Jing Wang, Yousuf El-Jayyousi, Ilker Ozden

TL;DR
This paper introduces a neural network model inspired by brain mechanisms that learns timing control through reinforcement, capturing both long-term memory effects and short-term variability adjustments in human-like timing behavior.
Contribution
The model combines recurrent neural networks with reward-dependent variability, providing a mechanistic and Bayesian framework that better mimics human timing learning than previous models.
Findings
Successfully replicates key features of human timing behavior
Estimates outcome uncertainty to distinguish variability sources
Enhances reinforcement learning for continuous control tasks
Abstract
How do humans and animals perform trial-and-error learning when the space of possibilities is infinite? In a previous study, we used an interval timing production task and discovered an updating strategy in which the agent adjusted the behavioral and neuronal noise for exploration. In the experiment, human subjects proactively generated a series of timed motor outputs. We found that the sequential motor timing varied at two temporal scales: long-term correlation around the target interval due to memory drifts and short-term adjustments of timing variability according to feedback. We have previously described these features of timing variability with an augmented Gaussian process, termed reward sensitive Gaussian process (RSGP). Here we provide a mechanistic model and simulate the process by borrowing the architecture of recurrent neural networks. While recurrent connection provided the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural dynamics and brain function · Neural Networks and Applications · Control Systems and Identification
MethodsGaussian Process
