RDIS: Random Drop Imputation with Self-Training for Incomplete Time   Series Data

Tae-Min Choi; Ji-Su Kang; Jong-Hwan Kim

arXiv:2010.10075·cs.LG·November 21, 2024·6 cites

RDIS: Random Drop Imputation with Self-Training for Incomplete Time Series Data

Tae-Min Choi, Ji-Su Kang, Jong-Hwan Kim

PDF

Open Access

TL;DR

This paper introduces RDIS, a novel training method for time-series imputation that explicitly trains models by generating extra missing data and refining pseudo values through self-training and entropy filtering.

Contribution

RDIS is the first method to explicitly train imputation models with artificially generated missing data and self-training for improved accuracy.

Findings

01

Achieves competitive results on real-world datasets.

02

Effectively improves imputation accuracy across various models.

03

Utilizes entropy filtering to enhance pseudo value quality.

Abstract

Time-series data with missing values are commonly encountered in many fields, such as healthcare, meteorology, and robotics. The imputation aims to fill the missing values with valid values. Most imputation methods trained the models implicitly because missing values have no ground truth. In this paper, we propose Random Drop Imputation with Self-training (RDIS), a novel training method for time-series data imputation models. In RDIS, we generate extra missing values by applying a random drop on the observed values in incomplete data. We can explicitly train the imputation models by filling in the randomly dropped values. In addition, we adopt self-training with pseudo values to exploit the original missing values. To improve the quality of pseudo values, we set the threshold and filter them by calculating the entropy. To verify the effectiveness of RDIS on the time series imputation,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Stream Mining Techniques · Time Series Analysis and Forecasting · Gaussian Processes and Bayesian Inference

MethodsGated Recurrent Unit