Coverage Analysis for Digital Cousin Selection -- Improving Multi-Environment Q-Learning
Talha Bozkus, Tara Javidi, and Urbashi Mitra

TL;DR
This paper provides a probabilistic coverage analysis for multi-environment mixed Q-learning algorithms, introducing a new coverage coefficient-based method that enhances accuracy, reduces complexity, and outperforms existing algorithms in large network optimization tasks.
Contribution
The paper develops a probabilistic coverage analysis framework for MEMQ algorithms, introduces a novel CC-based MEMQ algorithm, and demonstrates significant improvements in accuracy and efficiency over prior methods.
Findings
Reduces average policy error by 65% compared to partial ordering.
Achieves 95% faster computation than exhaustive search.
Outperforms state-of-the-art algorithms with 60% less APE.
Abstract
Q-learning is widely employed for optimizing various large-dimensional networks with unknown system dynamics. Recent advancements include multi-environment mixed Q-learning (MEMQ) algorithms, which utilize multiple independent Q-learning algorithms across multiple, structurally related but distinct environments and outperform several state-of-the-art Q-learning algorithms in terms of accuracy, complexity, and robustness. We herein conduct a comprehensive probabilistic coverage analysis to ensure optimal data coverage conditions for MEMQ algorithms. First, we derive upper and lower bounds on the expectation and variance of different coverage coefficients (CC) for MEMQ algorithms. Leveraging these bounds, we develop a simple way of comparing the utilities of multiple environments in MEMQ algorithms. This approach appears to be near optimal versus our previously proposed partial ordering…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and ELM
MethodsQ-Learning
