Oracle Efficient Algorithms for Groupwise Regret

Krishna Acharya; Eshwar Ram Arunachaleswaran; Sampath Kannan; Aaron; Roth; Juba Ziani

arXiv:2310.04652·cs.LG·October 10, 2023

Oracle Efficient Algorithms for Groupwise Regret

Krishna Acharya, Eshwar Ram Arunachaleswaran, Sampath Kannan, Aaron, Roth, Juba Ziani

PDF

Open Access 1 Video 3 Reviews

TL;DR

This paper introduces an efficient algorithm for online prediction that guarantees low regret across multiple intersecting groups, improving computational feasibility and performance in complex, real-world datasets.

Contribution

It presents a simple modification of sleeping experts technique to achieve groupwise regret guarantees with linear runtime in the number of groups, scalable to large models.

Findings

01

Algorithm achieves similar regret guarantees to prior work but with improved efficiency.

02

Runs in linear time relative to the number of groups and is oracle-efficient in the hypothesis class.

03

Experimental results show substantial error reductions across groups on real datasets.

Abstract

We study the problem of online prediction, in which at each time step $t$ , an individual $x_{t}$ arrives, whose label we must predict. Each individual is associated with various groups, defined based on their features such as age, sex, race etc., which may intersect. Our goal is to make predictions that have regret guarantees not just overall but also simultaneously on each sub-sequence comprised of the members of any single group. Previous work such as [Blum & Lykouris] and [Lee et al] provide attractive regret guarantees for these problems; however, these are computationally intractable on large model classes. We show that a simple modification of the sleeping experts technique of [Blum & Lykouris] yields an efficient reduction to the well-understood problem of obtaining diminishing external regret absent group considerations. Our approach gives similar regret guarantees compared to…

Peer Reviews

Decision·ICLR 2024 poster

Reviewer 01Rating 3· reject, not good enoughConfidence 4

Strengths

The topic is relevant to the machine learning community as it reflects the multi-objective nature of online prediction problems. The experimental results show the proposed method works well in practice. The experimental results show the superiority of the proposed method against baselines.

Weaknesses

I am afraid that the proof of the main theorem (Theorem 2) might be wrong, or at least incomplete. Simply put, the algorithm aggregates sleeping experts where each sleeping expert is awake only when the trial belongs to a designated subsequence of trials. Then, the theorem trivially holds when each subsequence of trials is disjoint to each other, as mentioned in the paper. On the other hand, if subsequences intersect, it is not fully clear if the proof is correct.

Reviewer 02Rating 8· accept, good paperConfidence 3

Strengths

While building on prior work by Blum & Lykouris (2019), the paper introduces a simple yet meaningful modification that enhances computational efficiency. This facilitates the use of large hypothesis classes, such as linear models. The reduction to standard external regret minimization, while expected, remains theoretically novel. The algorithm's design and analysis are technically sound, and the paper offers a comprehensive set of experiments. The paper is well-written and easy to follow, with

Weaknesses

I believe that including an experimental comparison with the Blum & Lykouris (2019) approach would better justify the superiority of the new method. Is it possible to conduct such a comparison using a smaller model class?

Reviewer 03Rating 8· accept, good paperConfidence 4

Strengths

Overall, I like this paper as it provides the practical algorithm for a problem where the existence of the solution (or even, a theoretically-efficient algorithm) has been known, but no practically-efficient algorithm was proposed. This serves as a nice ``bridge’’ between theory and practice. As the paper itself has mentioned, practical group-wise regret-minimization algorithms have various downstream applications, including algorithmic fairness. I also read through the analysis in Appendix B,

Weaknesses

One criticism I have when reading the paper is that the paper is not presented in a fully self-contained and rigorous manner. For instance, the proposed algorithm uses AdaNormalHedge as a black box, but the guarantee of such an algorithm is never formally described. (I am aware of the description in Appendix B, but there is no proposition + proper citation for this.) Similarly, when citing external regret minimization algorithms for applications, the formal quantifiers and guarantees for those a

Videos

Oracle Efficient Algorithms for Groupwise Regret· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Misinformation and Its Impacts · Advanced Causal Inference Techniques

MethodsLinear Regression