Nonparametric inference with massive data via grouped empirical likelihood
Yongda Wang, Shifeng Xiong

TL;DR
This paper introduces a grouped empirical likelihood (GEL) method that significantly reduces computational complexity for massive data analysis while maintaining the same inferential accuracy as traditional empirical likelihood methods.
Contribution
It proposes a novel GEL approach that groups data to lower optimization complexity and demonstrates its effectiveness through theoretical proofs and practical experiments.
Findings
GEL achieves the same asymptotic properties as traditional empirical likelihood.
GEL provides substantial computational speedup, analyzing billion-scale data in seconds.
Numerical simulations confirm GEL's accuracy and efficiency.
Abstract
To address the computational issue in empirical likelihood methods with massive data, this paper proposes a grouped empirical likelihood (GEL) method. It divides observations into groups, and assigns the same probability weight to all observations within the same group. GEL estimates the weights by maximizing the empirical likelihood ratio. The dimensionality of the optimization problem is thus reduced from to , thereby lowering the computational complexity. We prove that GEL possesses the same first order asymptotic properties as the conventional empirical likelihood method under the estimating equation settings and the classical two-sample mean problem. A distributed GEL method is also proposed with several servers. Numerical simulations and real data analysis demonstrate that GEL can keep the same inferential accuracy as the conventional empirical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Statistical Methods and Inference · Gaussian Processes and Bayesian Inference
