Oracle Efficient Algorithms for Groupwise Regret
Krishna Acharya, Eshwar Ram Arunachaleswaran, Sampath Kannan, Aaron, Roth, Juba Ziani

TL;DR
This paper introduces an efficient algorithm for online prediction that guarantees low regret across multiple intersecting groups, improving computational feasibility and performance in complex, real-world datasets.
Contribution
It presents a simple modification of sleeping experts technique to achieve groupwise regret guarantees with linear runtime in the number of groups, scalable to large models.
Findings
Algorithm achieves similar regret guarantees to prior work but with improved efficiency.
Runs in linear time relative to the number of groups and is oracle-efficient in the hypothesis class.
Experimental results show substantial error reductions across groups on real datasets.
Abstract
We study the problem of online prediction, in which at each time step , an individual arrives, whose label we must predict. Each individual is associated with various groups, defined based on their features such as age, sex, race etc., which may intersect. Our goal is to make predictions that have regret guarantees not just overall but also simultaneously on each sub-sequence comprised of the members of any single group. Previous work such as [Blum & Lykouris] and [Lee et al] provide attractive regret guarantees for these problems; however, these are computationally intractable on large model classes. We show that a simple modification of the sleeping experts technique of [Blum & Lykouris] yields an efficient reduction to the well-understood problem of obtaining diminishing external regret absent group considerations. Our approach gives similar regret guarantees compared to…
Peer Reviews
Decision·ICLR 2024 poster
The topic is relevant to the machine learning community as it reflects the multi-objective nature of online prediction problems. The experimental results show the proposed method works well in practice. The experimental results show the superiority of the proposed method against baselines.
I am afraid that the proof of the main theorem (Theorem 2) might be wrong, or at least incomplete. Simply put, the algorithm aggregates sleeping experts where each sleeping expert is awake only when the trial belongs to a designated subsequence of trials. Then, the theorem trivially holds when each subsequence of trials is disjoint to each other, as mentioned in the paper. On the other hand, if subsequences intersect, it is not fully clear if the proof is correct.
While building on prior work by Blum & Lykouris (2019), the paper introduces a simple yet meaningful modification that enhances computational efficiency. This facilitates the use of large hypothesis classes, such as linear models. The reduction to standard external regret minimization, while expected, remains theoretically novel. The algorithm's design and analysis are technically sound, and the paper offers a comprehensive set of experiments. The paper is well-written and easy to follow, with
I believe that including an experimental comparison with the Blum & Lykouris (2019) approach would better justify the superiority of the new method. Is it possible to conduct such a comparison using a smaller model class?
Overall, I like this paper as it provides the practical algorithm for a problem where the existence of the solution (or even, a theoretically-efficient algorithm) has been known, but no practically-efficient algorithm was proposed. This serves as a nice ``bridge’’ between theory and practice. As the paper itself has mentioned, practical group-wise regret-minimization algorithms have various downstream applications, including algorithmic fairness. I also read through the analysis in Appendix B,
One criticism I have when reading the paper is that the paper is not presented in a fully self-contained and rigorous manner. For instance, the proposed algorithm uses AdaNormalHedge as a black box, but the guarantee of such an algorithm is never formally described. (I am aware of the description in Appendix B, but there is no proposition + proper citation for this.) Similarly, when citing external regret minimization algorithms for applications, the formal quantifiers and guarantees for those a
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Misinformation and Its Impacts · Advanced Causal Inference Techniques
MethodsLinear Regression
