TL;DR
TokenCast introduces an innovative LLM-driven framework that converts time series data into symbolic tokens, enabling effective integration of numerical and textual context for improved forecasting accuracy across various real-world applications.
Contribution
The paper presents a novel symbolic discretization method and a unified semantic embedding approach using LLMs for context-aware time series forecasting.
Findings
Enhanced forecasting accuracy on diverse datasets
Effective integration of numerical and textual data
Generalizable approach across domains
Abstract
Time series forecasting plays a vital role in supporting decision-making across a wide range of critical applications, including energy, healthcare, and finance. Despite recent advances, forecasting accuracy remains limited due to the challenge of integrating historical numerical sequences with contextual features, which often comprise unstructured textual data. To address this challenge, we propose TokenCast, an LLM-driven framework that leverages language-based symbolic representations as a unified intermediary for context-aware time series forecasting. Specifically, TokenCast employs a discrete tokenizer to transform continuous numerical sequences into temporal tokens, enabling structural alignment with language-based inputs. To bridge the semantic gap between modalities, both temporal and contextual tokens are embedded into a shared representation space via a pre-trained large…
Peer Reviews
Decision·ICLR 2026 Conference Withdrawn Submission
- **Proper positioning within current research trends.** The paper is aligned with the recent movement toward symbolic or token-based time-series modeling, showing that the authors are aware of ongoing developments in the field. - **Well-organized framework.** The three-stage pipeline (tokenization, alignment, and generative prediction) is logically structured and easy to follow. - **Readable presentation.** The writing is clear, and figures effectively illustrate the workflow.
- **Lack of novelty relative to existing work.** The proposed vector quantization and tokenization strategy is highly similar to the approach used in Amazon’s Chronos model, which also discretizes numerical sequences into symbolic tokens for autoregressive forecasting. Several recent works (e.g., Chronos, Chronos-Bolt, SymbolicTS, and TokenTS) have already explored nearly identical ideas. The paper does not clearly differentiate itself in methodology or theoretical contribution, making the
1. The paper formulates context-aware forecasting as conditional sequence generation by mapping multivariate time series into discrete tokens, aligning them with text tokens in a shared LLM vocabulary, and autoregressively generating future trajectories. 2. The method includes reversible instance normalization using only historical context and a shared codebook, encoder-decoder, which keeps the tokenization invertible. 3. Experiments span six real-world domains and compare against LLM-based,
1. Dataset descriptions contain internal inconsistencies (e.g., the Economic dataset describes as daily in the main text but as monthly macroeconomic data in the appendix), which obscures the exact sampling frequency and temporal structure assumed in training and evaluation. 2. The reported MSE/MAE averages lack standard deviations, confidence intervals, or significance tests, which limits assessment of robustness when baseline performance is numerically close. 3. The paper only sketches how co
1. The paper introduces a novel LLM-driven framework, named TokenCast, for time series forecasting by leveraging LLMs to utilize unstructured contextual information. 2. The paper is clearly written and well-organized, making it easy to follow the main ideas. The methodology is technically sound and clearly explained.
1. The discussion of related work on contextual information integration could be strengthened. While many existing approaches incorporate numeric contextual signals to enhance forecasting, the integration of unstructured contextual information requires cross-modal alignment strategies. Several recent studies have explored this direction; however, this emerging line of work is not sufficiently discussed or contrasted with TokenCast. 2. In line 181, the paper states that RevIN may risk leaking fut
- The framework is well-motivated and clearly presented. - The proposed method is extensively evaluated on various real-world datasets, covering diverse domains such as healthcare, finance, and environmental monitoring. TokenCast consistently outperforms existing baselines.
- The paper claims to leverage the modeling and reasoning capabilities of LLMs, which are generally associated with larger-scale models. However, the experiments primarily rely on a relatively small LLM (Qwen2.5-0.5B). This raises questions about whether the claimed reasoning capabilities are being fully utilized and whether such a small-scale LLM can truly demonstrate the generative and reasoning power the framework aims to exploit. The choice of model size contradicts typical expectations for
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
