Learning Minimax Estimators via Online Learning
Kartik Gupta, Arun Sai Suggala, Adarsh Prasad, Praneeth Netrapalli,, Pradeep Ravikumar

TL;DR
This paper introduces an algorithmic approach to designing minimax estimators by framing the problem as finding a Nash equilibrium in a zero-sum game, leveraging online learning techniques for non-convex losses.
Contribution
It presents a general algorithm for constructing minimax estimators using online learning methods, applicable to classical estimation problems like Gaussian sequence and linear regression.
Findings
Algorithm successfully finds minimax estimators in classical models.
Provides provably minimax estimators with theoretical guarantees.
Demonstrates effectiveness in finite Gaussian sequence and linear regression problems.
Abstract
We consider the problem of designing minimax estimators for estimating the parameters of a probability distribution. Unlike classical approaches such as the MLE and minimum distance estimators, we consider an algorithmic approach for constructing such estimators. We view the problem of designing minimax estimators as finding a mixed strategy Nash equilibrium of a zero-sum game. By leveraging recent results in online learning with non-convex losses, we provide a general algorithm for finding a mixed-strategy Nash equilibrium of general non-convex non-concave zero-sum games. Our algorithm requires access to two subroutines: (a) one which outputs a Bayes estimator corresponding to a given prior probability distribution, and (b) one which computes the worst-case risk of any given estimator. Given access to these two subroutines, we show that our algorithm outputs both a minimax estimator…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Learning Minimax Estimators Via Online Learning· youtube
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Control Systems and Identification
