TL;DR
PyPOTS is an open-source Python toolkit enabling end-to-end data mining and machine learning on partially-observed time series, covering tasks from imputation to anomaly detection.
Contribution
It introduces a comprehensive, unified framework for handling POTS with practical workflows and extensibility for research and production use.
Findings
Provides practical workflows for POTS tasks
Enables end-to-end pipeline construction
Open-source at https://github.com/WenjieDu/PyPOTS
Abstract
Partially-observed time series (POTS) is ubiquitous in real-world applications, yet most existing toolchains separate missing-value handling from downstream learning, which limits reproducibility and overall performance. This tutorial introduces PyPOTS, an open-source Python ecosystem for end-to-end data mining and machine learning on POTS. We present practical workflows spanning missingness simulation, data preprocessing, model training, and evaluation across core tasks, including imputation, forecasting, classification, clustering, and anomaly detection. The tutorial consists of two parts: Part I emphasizes hands-on application for practitioners through unified APIs and benchmark-oriented experiments. Part II targets developers and researchers, focusing on extending PyPOTS with custom models, domain-specific constraints, and contribution-ready engineering practices. Participants will…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
