Chain-of-thought Reviewing and Correction for Time Series Question Answering
Chen Su, Yuanhe Tian, Yan Song

TL;DR
This paper introduces T3LLM, a multi-model framework for time series question answering that improves reasoning accuracy through explicit correction and verification, achieving state-of-the-art results.
Contribution
The paper proposes T3LLM, a novel multi-LLM approach with correction mechanisms for time series QA, enhancing reasoning accuracy over existing methods.
Findings
T3LLM outperforms strong LLM baselines on multiple TSQA benchmarks.
Explicit correction improves reasoning accuracy in time series analysis.
The framework effectively internalizes multi-step reasoning and self-correction.
Abstract
With the advancement of large language models (LLMs), diverse time series analysis tasks are reformulated as time series question answering (TSQA) through a unified natural language interface. However, existing LLM-based approaches largely adopt general natural language processing techniques and are prone to reasoning errors when handling complex numerical sequences. Different from purely textual tasks, time series data are inherently verifiable, enabling consistency checking between reasoning steps and the original input. Motivated by this property, we propose T3LLM, which performs multi-step reasoning with an explicit correction mechanism for time series question answering. The T3LLM framework consists of three LLMs, namely, a worker, a reviewer, and a student, that are responsible for generation, review, and reasoning learning, respectively. Within this framework, the worker…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Time Series Analysis and Forecasting · Machine Learning in Healthcare
