Learning When to Act: Interval-Aware Reinforcement Learning with Predictive Temporal Structure
Davide Di Gioia

TL;DR
This paper presents a novel reinforcement learning approach that adaptively learns optimal action intervals using hyperbolic geometry and a new reward scheme, improving efficiency in continuous environments.
Contribution
It introduces a learned, interval-aware policy with hyperbolic geometry-based signals and a new reward function, advancing adaptive temporal control in reinforcement learning.
Findings
Hyperbolic spread signals improve timing decisions.
Learning the interval policy significantly boosts efficiency.
Spatial-temporal embeddings further enhance performance.
Abstract
Autonomous agents operating in continuous environments must decide not only what to do, but when to act. We introduce a lightweight adaptive temporal control system that learns the optimal interval between cognitive ticks from experience, replacing ad hoc biologically inspired timers with a principled learned policy. The policy state is augmented with a predictive hyperbolic spread signal (a "curvature signal" shorthand) derived from hyperbolic geometry: the mean pairwise Poincare distance among n sampled futures embedded in the Poincare ball. High spread indicates a branching, uncertain future and drives the agent to act sooner; low spread signals predictability and permits longer rest intervals. We further propose an interval-aware reward that explicitly penalises inefficiency relative to the chosen wait time, correcting a systematic credit-assignment failure of naive outcome-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Music Technology and Sound Studies · Neuroscience and Music Perception
