Bi-Level Online Provisioning and Scheduling with Switching Costs and Cross-Level Constraints

Jialei Liu; C. Emre Koksal; Ming Shi

arXiv:2601.18936·cs.LG·February 24, 2026

Bi-Level Online Provisioning and Scheduling with Switching Costs and Cross-Level Constraints

Jialei Liu, C. Emre Koksal, Ming Shi

PDF

Open Access

TL;DR

This paper introduces a novel bi-level online learning framework combining convex optimization and constrained Markov decision processes to optimize network resource provisioning and scheduling with dynamic thresholds and costs.

Contribution

It develops a new algorithm addressing bi-level online learning with switching costs and cross-level constraints, bridging the gap between OCO and CMDP models.

Findings

01

Achieves near-optimal regret bounds.

02

Ensures high-probability satisfaction of cross-level constraints.

03

Handles dynamic thresholds in provisioning and scheduling.

Abstract

We study a bi-level online provisioning and scheduling problem motivated by network resource allocation, where provisioning decisions are made at a slow time scale while queue-/state-dependent scheduling is performed at a fast time scale. We model this two-time-scale interaction using an upper-level online convex optimization (OCO) problem and a lower-level constrained Markov decision process (CMDP). Existing OCO typically assumes stateless decisions and thus cannot capture MDP network dynamics such as queue evolution. Meanwhile, CMDP algorithms typically assume a fixed constraint threshold, whereas in provisioning-and-scheduling systems, the threshold varies with online budget decisions. To address these gaps, we study bi-level OCO-CMDP learning under switching costs (budget reprovisioning/system reconfiguration) and cross-level constraints that couple budgets to scheduling decisions.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAge of Information Optimization · Advanced Queuing Theory Analysis · Reinforcement Learning in Robotics