MM-ISTS: Cooperating Irregularly Sampled Time Series Forecasting with Multimodal Vision-Text LLMs
Zhi Lei, Chenxi Liu, Hao Miao, Wanghui Qiu, Bin Yang, Chenjuan Guo

TL;DR
This paper introduces MM-ISTS, a multimodal framework using vision-text large language models to improve irregularly sampled time series forecasting by capturing complex temporal and contextual information across multiple modalities.
Contribution
The paper proposes a novel two-stage encoding mechanism with cross-modal vision-text encoding, enriched temporal features, and an adaptive query-based extractor for efficient, multimodal ISTS forecasting.
Findings
Effective in capturing intricate temporal patterns
Reduces computational costs with adaptive feature extraction
Improves forecasting accuracy on real-world data
Abstract
Irregularly sampled time series (ISTS) are widespread in real-world scenarios, exhibiting asynchronous observations on uneven time intervals across variables. Existing ISTS forecasting methods often solely utilize historical observations to predict future ones while falling short in learning contextual semantics and fine-grained temporal patterns. To address these problems, we achieve MM-ISTS, a multimodal framework augmented by vision-text large language models, that bridges temporal, visual, and textual modalities, facilitating ISTS forecasting. MM-ISTS encompasses a novel two-stage encoding mechanism. In particular, a cross-modal vision-text encoding module is proposed to automatically generate informative visual images and textual data, enabling the capture of intricate temporal patterns and comprehensive contextual understanding, in collaboration with multimodal LLMs (MLLMs). In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Multimodal Machine Learning Applications · Machine Learning in Healthcare
