Decision Support under Prediction-Induced Censoring

Yan Chen; Ruyi Huang; Cheng Liu

arXiv:2602.18031·eess.SY·February 23, 2026

Decision Support under Prediction-Induced Censoring

Yan Chen, Ruyi Huang, Cheng Liu

PDF

Open Access

TL;DR

This paper introduces PIC-RL, an adaptive reinforcement learning framework that addresses prediction-induced censoring in large-scale resource allocation, improving demand estimation and decision-making in generative AI serving systems.

Contribution

The paper proposes a novel closed-loop reinforcement learning approach that transforms censoring issues into decision signals, with theoretical guarantees and practical validation on Alibaba GenAI data.

Findings

01

PIC-RL reduces service degradation by up to 50%

02

It outperforms state-of-the-art baselines in demand prediction and resource allocation

03

The framework maintains cost efficiency while improving system robustness

Abstract

In many data-driven online decision systems, actions determine not only operational costs but also the data availability for future learning -- a phenomenon termed Prediction-Induced Censoring (PIC). This challenge is particularly acute in large-scale resource allocation for generative AI (GenAI) serving: insufficient capacity triggers shortages but hides the true demand, leaving the system with only a "greater-than" constraint. Standard decision-making approaches that rely on uncensored data suffer from selection bias, often locking the system into a self-reinforcing low-provisioning trap. To break this loop, this paper proposes an adaptive approach named PIC-Reinforcement Learning (PIC-RL), a closed-loop framework that transforms censoring from a data quality problem into a decision signal. PIC-RL integrates (1) Uncertainty-Aware Demand Prediction to manage the information-cost…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Stream Mining Techniques · Advanced Bandit Algorithms Research · Age of Information Optimization