A Theoretical Analysis of Detecting Large Model-Generated Time Series
Junji Hou, Junzhou Zhao, Shuo Zhang, Pinghui Wang

TL;DR
This paper introduces a theoretical framework and a new detection method for identifying large model-generated time series by leveraging the uncertainty contraction phenomenon under recursive forecasting.
Contribution
It proposes the contraction hypothesis and the Uncertainty Contraction Estimator (UCE), a novel white-box detector that outperforms existing methods in identifying synthetic time series.
Findings
UCE outperforms state-of-the-art baselines across 32 datasets.
Model-generated time series show decreasing uncertainty under recursive forecasting.
The contraction hypothesis is validated both theoretically and empirically.
Abstract
Motivated by the increasing risks of data misuse and fabrication, we investigate the problem of identifying synthetic time series generated by Time-Series Large Models (TSLMs) in this work. While there are extensive researches on detecting model generated text, we find that these existing methods are not applicable to time series data due to the fundamental modality difference, as time series usually have lower information density and smoother probability distributions than text data, which limit the discriminative power of token-based detectors. To address this issue, we examine the subtle distributional differences between real and model-generated time series and propose the contraction hypothesis, which states that model-generated time series, unlike real ones, exhibit progressively decreasing uncertainty under recursive forecasting. We formally prove this hypothesis under…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTime Series Analysis and Forecasting · Stock Market Forecasting Methods · Machine Learning in Healthcare
