Time Series Language Model for Descriptive Caption Generation
Mohamed Trabelsi, Aidan Boyd, Jin Cao, Huseyin Uzunalioglu

TL;DR
This paper introduces TSLM, a novel encoder-decoder language model designed for generating descriptive captions of time series data, addressing data scarcity and capturing subtle temporal patterns for improved interpretability.
Contribution
TSLM is the first specialized time series captioning model leveraging in-context synthetic data generation and cross-modal dense retrieval to enhance caption quality.
Findings
TSLM outperforms existing methods on multiple datasets.
Synthetic data generation improves model training.
Cross-modal dense retrieval enhances caption accuracy.
Abstract
The automatic generation of representative natural language descriptions for observable patterns in time series data enhances interpretability, simplifies analysis and increases cross-domain utility of temporal data. While pre-trained foundation models have made considerable progress in natural language processing (NLP) and computer vision (CV), their application to time series analysis has been hindered by data scarcity. Although several large language model (LLM)-based methods have been proposed for time series forecasting, time series captioning is under-explored in the context of LLMs. In this paper, we introduce TSLM, a novel time series language model designed specifically for time series captioning. TSLM operates as an encoder-decoder model, leveraging both text prompts and time series data representations to capture subtle temporal patterns across multiple phases and generate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Natural Language Processing Techniques · Multimodal Machine Learning Applications
