Convergence Analysis of Two-Layer Neural Networks under Gaussian Input Masking

Afroditi Kolomvaki; Fangshuo Liao; Evan Dramko; Ziyun Guang; Anastasios Kyrillidis

arXiv:2602.17423·cs.LG·February 20, 2026

Convergence Analysis of Two-Layer Neural Networks under Gaussian Input Masking

Afroditi Kolomvaki, Fangshuo Liao, Evan Dramko, Ziyun Guang, Anastasios Kyrillidis

PDF

Open Access

TL;DR

This paper analyzes the convergence of two-layer neural networks trained with Gaussian masked inputs, showing linear convergence up to an error bound related to the mask's variance, using NTK analysis.

Contribution

It introduces a novel NTK-based analysis for neural networks with Gaussian input masking, addressing the randomness in non-linear activations.

Findings

01

Achieves linear convergence with Gaussian masked inputs

02

Error region proportional to mask variance

03

Addresses randomness in non-linear activation functions

Abstract

We investigate the convergence guarantee of two-layer neural network training with Gaussian randomly masked inputs. This scenario corresponds to Gaussian dropout at the input level, or noisy input training common in sensor networks, privacy-preserving training, and federated learning, where each user may have access to partial or corrupted features. Using a Neural Tangent Kernel (NTK) analysis, we demonstrate that training a two-layer ReLU network with Gaussian randomly masked inputs achieves linear convergence up to an error region proportional to the mask's variance. A key technical contribution is resolving the randomness within the non-linear activation, a problem of independent interest.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Machine Learning and ELM · Advanced Neural Network Applications