Operator Learning Using Weak Supervision from Walk-on-Spheres
Hrishikesh Viswanath, Hong Chul Nam, Xi Deng, Julius Berner, Anima Anandkumar, Aniket Bera

TL;DR
This paper introduces WoS-NO, a neural operator training method using Monte Carlo-based weak supervision via Walk-on-Spheres, enabling efficient, dataset-free PDE solving with improved accuracy and generalization.
Contribution
It proposes a novel Monte Carlo-based weak supervision scheme using Walk-on-Spheres for training neural operators without precomputed datasets.
Findings
Up to 8.75× better L2-error compared to standard PINNs
Up to 6.31× faster training speed
GPU memory reduction of up to 2.97×
Abstract
Training neural PDE solvers is often bottlenecked by expensive data generation or unstable physics-informed neural network (PINN) involving challenging optimization landscapes due to higher-order derivatives. To tackle this issue, we propose an alternative approach using Monte Carlo approaches to estimate the solution to the PDE as a stochastic process for weak supervision during training. Leveraging the Walk-on-Spheres method, we introduce a learning scheme called \emph{Walk-on-Spheres Neural Operator (WoS-NO)} which uses weak supervision from WoS to train any given neural operator. We propose to amortize the cost of Monte Carlo walks across the distribution of PDE instances using stochastic representations from the WoS algorithm to generate cheap, noisy, estimates of the PDE solution during training. This is formulated into a data-free physics-informed objective where a neural…
Peer Reviews
Decision·Submitted to ICLR 2026
- The paper demonstrates that inexpensive unbiased estimates of the solution can be effective for physics-informed neural operator learning. This aligns with the broader trend of scaling training data using pseudo-labels across wider problem domains. - Some efforts have also been made to extend the framework to nonlinear PDEs. - As the approach relies on pseudo-labeling, the proposed framework remains orthogonal to the choice of neural network architecture.
- Another related approach is PI-DeepONet (Wang et al., 2021), which adopts a PINN-style residual loss. It would be interesting to clarify how WoS-NO compares to this method, especially since PI-DeepONet is less constrained regarding the types of PDEs it can handle. - Given the inherent specificity of the Walk-on-Spheres algorithm, WoS-NO appears applicable only to a limited class of PDEs. It would be valuable to discuss the potential for extending this framework to broader PDE families. Refere
- The experimental results showcase an improvement wrt to baseline. - The proposed method is both GPU-memory and time efficient and performs best on the studied problems.
- My main concerns is about the scope of applicability : in my understanding the proposed method applies only to a limited familly of PDE : Poisson PDE. - It is not detailed why focusing that much on such PDEs is important. - The paper is hard to read, I think some re-writing would help the understanding of the paper (eg in section 2.2 which give a lot of references which makes the paragraph hard to follow). I felt hard to understand the key objectives of the paper. - Some experimental detail
1. **Well-Defined and Significant Problem:** The paper accurately identifies a critical bottleneck in current PDE solving: the high cost of pre-computed data and the instability of PINN optimization. Addressing this issue is crucial for advancing practical applications in scientific computing. 2. **Novelty of the Weakly-Supervised Training Strategy:** The proposed idea of using the WoS stochastic process for weak supervision is pioneering in this field. It cleverly combines the unbiased nature o
1. **Limited Scope of Problem Types:** The method is primarily applied to the family of Poisson equations. Its applicability to a broader range of PDE types (e.g., Navier-Stokes equations, wave equations) remains unverified. The paper should discuss the potential and challenges of extending this framework to other important PDE classes. 2. **Lack of Systematic Study on WoS Parameter Selection:** The choice of the number of WoS trajectories (L ≤ 10) appears empirical. There is a lack of systemati
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Machine Learning in Materials Science · Generative Adversarial Networks and Image Synthesis
