Concentration Tail-Bound Analysis of Coevolutionary and Bandit Learning Algorithms
Per Kristian Lehre, Shishen Lin

TL;DR
This paper introduces a new drift theorem that provides exponential tail-bounds for the runtime and regret of algorithms, enabling high-probability guarantees and revealing insights into algorithm reliability and weaknesses.
Contribution
It presents a novel drift theorem for deriving concentration tail-bounds applicable to algorithms with various drift conditions, including negative drift, and demonstrates its use in analyzing bandit and co-evolutionary algorithms.
Findings
The regret of the wab bandit algorithm is highly concentrated.
The time for RLS-PD to find a Nash equilibrium is highly concentrated.
RLS-PD forgets the Nash equilibrium after a concentrated time.
Abstract
Runtime analysis, as a branch of the theory of AI, studies how the number of iterations algorithms take before finding a solution (its runtime) depends on the design of the algorithm and the problem structure. Drift analysis is a state-of-the-art tool for estimating the runtime of randomised algorithms, such as evolutionary and bandit algorithms. Drift refers roughly to the expected progress towards the optimum per iteration. This paper considers the problem of deriving concentration tail-bounds on the runtime/regret of algorithms. It provides a novel drift theorem that gives precise exponential tail-bounds given positive, weak, zero and even negative drift. Previously, such exponential tail bounds were missing in the case of weak, zero, or negative drift. Our drift theorem can be used to prove a strong concentration of the runtime/regret of algorithms in AI. For example, we prove that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Advanced Bandit Algorithms Research · Reinforcement Learning in Robotics
