How Does Data Freshness Affect Real-time Supervised Learning?
Md Kamran Chowdhury Shisher, Yin Sun

TL;DR
This paper investigates how data freshness impacts real-time supervised learning performance, revealing non-monotonic effects and proposing new scheduling strategies to optimize inference accuracy based on Age of Information.
Contribution
It introduces a novel analysis of data freshness effects, including non-monotonic behaviors, and develops low-complexity scheduling algorithms using Gittins and Whittle indices for improved real-time inference.
Findings
Prediction error can be non-monotonic with data age.
Proposed scheduling strategies outperform existing methods.
New connection established between Gittins index theory and AoI minimization.
Abstract
In this paper, we analyze the impact of data freshness on real-time supervised learning, where a neural network is trained to infer a time-varying target (e.g., the position of the vehicle in front) based on features (e.g., video frames) observed at a sensing node (e.g., camera or lidar). One might expect that the performance of real-time supervised learning degrades monotonically as the feature becomes stale. Using an information-theoretic analysis, we show that this is true if the feature and target data sequence can be closely approximated as a Markov chain; it is not true if the data sequence is far from Markovian. Hence, the prediction error of real-time supervised learning is a function of the Age of Information (AoI), where the function could be non-monotonic. Several experiments are conducted to illustrate the monotonic and non-monotonic behaviors of the prediction error. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAge of Information Optimization
