Learning Tennis Strategy Through Curriculum-Based Dueling Double Deep Q-Networks
Vishnu Mohan

TL;DR
This paper introduces a reinforcement learning framework using curriculum-based dueling double deep Q-networks to optimize tennis strategies, achieving high win rates and demonstrating the importance of curriculum and architecture choices.
Contribution
It presents a novel tennis simulation environment combined with a curriculum learning approach and a dueling DDQN architecture for effective strategy learning.
Findings
Achieved 98-100% win rates against balanced opponents.
Demonstrated the necessity of curriculum learning and dueling architecture for stability.
Identified a defensive bias in learned strategies, emphasizing error avoidance.
Abstract
Tennis strategy optimization is a challenging sequential decision-making problem involving hierarchical scoring, stochastic outcomes, long-horizon credit assignment, physical fatigue, and adaptation to opponent skill. I present a reinforcement learning framework that integrates a custom tennis simulation environment with a Dueling Double Deep Q-Network(DDQN) trained using curriculum learning. The environment models complete tennis scoring at the level of points, games, and sets, rally-level tactical decisions across ten discrete action categories, symmetric fatigue dynamics, and a continuous opponent skill parameter. The dueling architecture decomposes action-value estimation into state-value and advantage components, while double Q-learning reduces overestimation bias and improves training stability in this long-horizon stochastic domain. Curriculum learning progressively increases…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSports Analytics and Performance · Reinforcement Learning in Robotics · Sports Performance and Training
