Learning Minimax Estimators via Online Learning

Kartik Gupta; Arun Sai Suggala; Adarsh Prasad; Praneeth Netrapalli,; Pradeep Ravikumar

arXiv:2006.11430·stat.ML·June 23, 2020

Learning Minimax Estimators via Online Learning

Kartik Gupta, Arun Sai Suggala, Adarsh Prasad, Praneeth Netrapalli,, Pradeep Ravikumar

PDF

Open Access 1 Video

TL;DR

This paper introduces an algorithmic approach to designing minimax estimators by framing the problem as finding a Nash equilibrium in a zero-sum game, leveraging online learning techniques for non-convex losses.

Contribution

It presents a general algorithm for constructing minimax estimators using online learning methods, applicable to classical estimation problems like Gaussian sequence and linear regression.

Findings

01

Algorithm successfully finds minimax estimators in classical models.

02

Provides provably minimax estimators with theoretical guarantees.

03

Demonstrates effectiveness in finite Gaussian sequence and linear regression problems.

Abstract

We consider the problem of designing minimax estimators for estimating the parameters of a probability distribution. Unlike classical approaches such as the MLE and minimum distance estimators, we consider an algorithmic approach for constructing such estimators. We view the problem of designing minimax estimators as finding a mixed strategy Nash equilibrium of a zero-sum game. By leveraging recent results in online learning with non-convex losses, we provide a general algorithm for finding a mixed-strategy Nash equilibrium of general non-convex non-concave zero-sum games. Our algorithm requires access to two subroutines: (a) one which outputs a Bayes estimator corresponding to a given prior probability distribution, and (b) one which computes the worst-case risk of any given estimator. Given access to these two subroutines, we show that our algorithm outputs both a minimax estimator…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Learning Minimax Estimators Via Online Learning· youtube

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Control Systems and Identification