TOTEM: TOkenized Time Series EMbeddings for General Time Series Analysis
Sabera Talukder, Yisong Yue, Georgia Gkioxari

TL;DR
This paper introduces TOTEM, a tokenized embedding approach for general time series analysis that performs well across multiple tasks and datasets with minimal fine-tuning, inspired by large language models.
Contribution
The paper proposes TOTEM, a novel tokenization-based method for creating generalist time series models that excel in zero-shot performance across diverse tasks and datasets.
Findings
TOTEM matches or outperforms state-of-the-art models in multiple tasks.
It demonstrates strong zero-shot performance with minimal fine-tuning.
Extensive experiments validate the effectiveness of tokenization for general time series analysis.
Abstract
This work studies the problem of time series analysis with generalist (or foundation) models, which are models trained across many data domains. Drawing inspiration from the widespread success of large language models, we consider the simple strategy of discretely tokenizing time series data drawn from a myriad of datasets via self-supervision, then using the fixed tokenization to solve a variety of tasks across many data domains. Canonically, time series models are either trained on a single dataset or built in a task-specific manner (e.g., a forecasting-only model), where many use patches of time as inputs to the model. As such, performant generalist, discrete representation time series models explored across many tasks are of value. Our method, TOkenized Time Series EMbeddings (TOTEM), produces such generalist time series models with minimal or no fine-tuning while exhibiting strong…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting
