Fast Nonseparable Gaussian Stochastic Process with Application to Methylation Level Interpolation
Mengyang Gu, Yanxun Xu

TL;DR
This paper introduces a fast, exact nonseparable Gaussian stochastic process model that significantly reduces computational complexity from cubic to linear, enabling large-scale methylation level interpolation with high accuracy.
Contribution
The paper proposes a novel nonseparable GaSP model with a linear computational algorithm, extending the applicability of GaSP to large datasets without approximation.
Findings
Accurately predicts genome-wide DNA methylation levels.
Computational complexity reduced from O(n^3) to O(n).
Outperforms linear regression, random forest, and localized Kriging.
Abstract
Gaussian stochastic process (GaSP) has been widely used as a prior over functions due to its flexibility and tractability in modeling. However, the computational cost in evaluating the likelihood is , where is the number of observed points in the process, as it requires to invert the covariance matrix. This bottleneck prevents GaSP being widely used in large-scale data. We propose a general class of nonseparable GaSP models for multiple functional observations with a fast and exact algorithm, in which the computation is linear () and exact, requiring no approximation to compute the likelihood. We show that the commonly used linear regression and separable models are special cases of the proposed nonseparable GaSP model. Through the study of an epigenetic application, the proposed nonseparable GaSP model can accurately predict the genome-wide DNA methylation levels and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
