Time-RA: Towards Time Series Reasoning for Anomaly Diagnosis with LLM Feedback

Yiyuan Yang; Zichuan Liu; Lei Song; Kai Ying; Zhiguang Wang; Tom Bamford; Svitlana Vyetrenko; Jiang Bian; Qingsong Wen

arXiv:2507.15066·cs.LG·April 17, 2026

Time-RA: Towards Time Series Reasoning for Anomaly Diagnosis with LLM Feedback

Yiyuan Yang, Zichuan Liu, Lei Song, Kai Ying, Zhiguang Wang, Tom Bamford, Svitlana Vyetrenko, Jiang Bian, Qingsong Wen

PDF

1 Repo 1 Datasets

TL;DR

Time-RA introduces a reasoning-based approach to time series anomaly detection, emphasizing interpretability and multimodal data integration, supported by a large-scale benchmark and open-source code.

Contribution

It reformulates TSAD into a generative, reasoning-oriented task and provides the first multimodal benchmark with extensive annotations and open-source resources.

Findings

01

Supervised fine-tuning improves diagnostic accuracy.

02

Visual representations enhance reasoning consistency.

03

Fine-tuned models outperform traditional baselines on unseen datasets.

Abstract

Time series anomaly detection (TSAD) has traditionally focused on binary classification and often lacks the fine-grained categorization and explanatory reasoning required for transparent decision-making. To address these limitations, we propose Time-series Reasoning for Anomaly (Time-RA), a novel task that reformulates TSAD from a discriminative into a generative, reasoning-intensive paradigm. To facilitate this, we introduce RATs40K, the first real-world large-scale multimodal benchmark with ~40,000 samples across 10 domains, integrating raw time series, textual context, and visual plots with structured reasoning annotations. Extensive benchmarking shows that while supervised fine-tuning and visual representations boost diagnostic accuracy and reasoning consistency, performance varies across complex scenarios. Notably, fine-tuned models demonstrate strong "plug-and-play"…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yyysjz1997/Time-RA
github

Datasets

Time-RA/RATs40K
dataset· 121 dl
121 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.