Missing Data using Decision Forest and Computational Intelligence
D. Moon, T. Marwala

TL;DR
This paper presents a method combining autoencoder neural networks, genetic algorithms, and decision forests to estimate and improve handling of missing data under the Missing At Random assumption.
Contribution
It introduces a novel approach integrating neural networks, genetic algorithms, and decision forests for missing data estimation and optimization.
Findings
Decision forests improve estimation accuracy
The combined method reduces mean square error
Effective handling of Missing At Random data
Abstract
Autoencoder neural network is implemented to estimate the missing data. Genetic algorithm is implemented for network optimization and estimating the missing data. Missing data is treated as Missing At Random mechanism by implementing maximum likelihood algorithm. The network performance is determined by calculating the mean square error of the network prediction. The network is further optimized by implementing Decision Forest. The impact of missing data is then investigated and decision forrests are found to improve the results.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Rough Sets and Fuzzy Logic · Face and Expression Recognition
