Interpretable Multi-Source Data Fusion Through Latent Variable Gaussian Process
Sandipp Krishnan Ravi, Yigitcan Comlek, Arjun Pathak, Vipul Gupta,, Rajnikant Umretiya, Andrew Hoffman, Ghanshyam Pilania, Piyush Pandita, Sayan, Ghosh, Nathaniel Mckeever, Wei Chen, Liping Wang

TL;DR
This paper introduces a novel multi-source data fusion framework using Latent Variable Gaussian Processes that effectively combines diverse data sources into a single, interpretable predictive model, improving accuracy in sparse-data scenarios.
Contribution
The paper proposes a source-aware data fusion method based on LVGP that accounts for differences in data quality and interpretability, addressing limitations of existing fusion techniques.
Findings
Enhanced prediction accuracy over single-source models
Effective handling of sparse data problems
Ability to interpret source differences via latent variables
Abstract
With the advent of artificial intelligence and machine learning, various domains of science and engineering communities have leveraged data-driven surrogates to model complex systems through fusing numerous sources of information (data) from published papers, patents, open repositories, or other resources. However, not much attention has been paid to the differences in quality and comprehensiveness of the known and unknown underlying physical parameters of the information sources, which could have downstream implications during system optimization. Additionally, existing methods cannot fuse multi-source data into a single predictive model. Towards resolving this issue, a multi-source data fusion framework based on Latent Variable Gaussian Process (LVGP) is proposed. The individual data sources are tagged as a characteristic categorical variable that are mapped into a physically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Time Series Analysis and Forecasting
MethodsGaussian Process
