A Common-Factor Approach for Multivariate Data Cleaning with an Application to Mars Phoenix Mission Data
Dongping Fang, Elizabeth Oberlin, Wei Ding, Samuel P. Kounaves

TL;DR
This paper introduces a common-factor data cleaning method that analyzes multivariate data collectively to identify and remove external interference, improving data reliability in scientific exploration contexts like Mars missions.
Contribution
It proposes a novel data-driven approach that captures shared underlying factors across multiple signals to enhance data cleaning without distorting true measurements.
Findings
Effective removal of external impacts from multivariate data
Preservation of the original data mean levels
Improved data quality for scientific analysis
Abstract
Data quality is fundamentally important to ensure the reliability of data for stakeholders to make decisions. In real world applications, such as scientific exploration of extreme environments, it is unrealistic to require raw data collected to be perfect. As data miners, when it is infeasible to physically know the why and the how in order to clean up the data, we propose to seek the intrinsic structure of the signal to identify the common factors of multivariate data. Using our new data driven learning method, the common-factor data cleaning approach, we address an interdisciplinary challenge on multivariate data cleaning when complex external impacts appear to interfere with multiple data measurements. Existing data analyses typically process one signal measurement at a time without considering the associations among all signals. We analyze all signal measurements simultaneously to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoil Geostatistics and Mapping · Advanced Statistical Methods and Models · Geochemistry and Geologic Mapping
