Opponent State Inference Under Partial Observability: An HMM-POMDP Framework for 2026 Formula 1 Energy Strategy
Kalliopi Kleisarchaki

TL;DR
This paper introduces a two-layer inference and decision framework using an HMM and DQN to infer opponent states and optimize energy strategies in 2026 Formula 1 races under partial observability.
Contribution
It develops a novel HMM-DQN framework for opponent state inference and strategic decision-making in complex, partially observable racing scenarios, addressing a new challenge posed by 2026 regulations.
Findings
HMM achieves 96.8% ERS-level accuracy
Classifies L_harvest vs. L_derate with 89.4% accuracy
Detects counter-harvest trap with 96.3% recall
Abstract
The 2026 Formula 1 technical regulations introduce a fundamental change to energy strategy: under a 50/50 internal combustion engine / battery power split with unlimited regeneration and a driver-controlled Override Mode, the optimal energy deployment policy depends not only on a driver's own state but on the hidden state of rival cars. This creates a Partially Observable Stochastic Game that cannot be solved by single-agent optimisation methods. We present a tractable two-layer inference and decision framework. The first layer is a 40-state Hidden Markov Model (HMM) that infers a probability distribution over each rival's ERS charge level (four modes: H, M, L_harvest, L_derate), Override Mode status, and tyre degradation state from six publicly observable telemetry signals. The second layer is a Deep Q-Network (DQN) policy that takes the HMM belief state as input and selects between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsElectric and Hybrid Vehicle Technologies · Electric Vehicles and Infrastructure · Advanced Battery Technologies Research
