Missing Data using Decision Forest and Computational Intelligence

D. Moon; T. Marwala

arXiv:0812.1615·stat.ML·December 10, 2008·1 cites

Missing Data using Decision Forest and Computational Intelligence

D. Moon, T. Marwala

PDF

Open Access

TL;DR

This paper presents a method combining autoencoder neural networks, genetic algorithms, and decision forests to estimate and improve handling of missing data under the Missing At Random assumption.

Contribution

It introduces a novel approach integrating neural networks, genetic algorithms, and decision forests for missing data estimation and optimization.

Findings

01

Decision forests improve estimation accuracy

02

The combined method reduces mean square error

03

Effective handling of Missing At Random data

Abstract

Autoencoder neural network is implemented to estimate the missing data. Genetic algorithm is implemented for network optimization and estimating the missing data. Missing data is treated as Missing At Random mechanism by implementing maximum likelihood algorithm. The network performance is determined by calculating the mean square error of the network prediction. The network is further optimized by implementing Decision Forest. The impact of missing data is then investigated and decision forrests are found to improve the results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Mining Algorithms and Applications · Rough Sets and Fuzzy Logic · Face and Expression Recognition