Denoising Data with Measurement Error Using a Reproducing Kernel-based Diffusion Model
Mingyang Yi, Marcos Matabuena, Ruoyu Wang

TL;DR
This paper introduces a novel kernel-based diffusion model framework for denoising measurement error-affected data, enabling more accurate analysis in fields like healthcare and biology.
Contribution
It develops a Reproducing Kernel Hilbert Space-based method that trains diffusion models with error-contaminated data and provides theoretical guarantees and fast convergence.
Findings
The method achieves a closed-form solution and fast convergence.
It provides an upper bound on the divergence between denoised and true data distributions.
Simulations and real-world application demonstrate superior performance.
Abstract
The ongoing technological revolution in measurement systems enables the acquisition of high-resolution samples in fields such as engineering, biology, and medicine. However, these observations are often subject to errors from measurement devices. Motivated by this challenge, we propose a denoising framework that employs diffusion models to generate denoised data whose distribution closely approximates the unobservable, error-free data, thereby permitting standard data analysis based on the denoised data. The key element of our framework is a novel Reproducing Kernel Hilbert Space-based method that trains the diffusion model with only error-contaminated data, admits a closed-form solution, and achieves a fast convergence rate in terms of estimation error. Furthermore, we verify the effectiveness of our method by deriving an upper bound on the Kullback--Leibler divergence between the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Signal Denoising Methods
