Learning to Cover: Online Learning and Optimization with Irreversible Decisions
Alexandre Jacquillat, Michael Lingzhi Li

TL;DR
This paper studies an online learning problem involving irreversible decisions for facility coverage, providing algorithms with optimal regret bounds and demonstrating the benefits of limited exploration in large-scale, coverage-focused decision-making.
Contribution
It introduces a novel online learning framework with irreversible decisions, derives asymptotically optimal algorithms, and establishes tight regret bounds in large-scale coverage problems.
Findings
The classifier converges to the Bayes-optimal at a rate of O(1/√n).
The regret grows sub-linearly, with bounds depending on the learning rate and problem parameters.
Limited exploration followed by exploitation improves decision-making efficiency.
Abstract
We define an online learning and optimization problem with discrete and irreversible decisions contributing toward a coverage target. In each period, a decision-maker selects facilities to open, receives information on the success of each one, and updates a classification model to guide future decisions. The goal is to minimize facility openings under a chance constraint reflecting the coverage target, in an asymptotic regime characterized by a large target number of facilities but a finite horizon . We prove that, under statistical conditions, the online classifier converges to the Bayes-optimal classifier at a rate of at best . Thus, we formulate our online learning and optimization problem, with a generalized learning rate and a residual error . We derive an asymptotically optimal algorithm and an asymptotically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning
