On the KL-Divergence-based Robust Satisficing Model

Haojie Yan; Minglong Zhou; Jiayi Guo

arXiv:2408.09157·cs.LG·August 20, 2024

On the KL-Divergence-based Robust Satisficing Model

Haojie Yan, Minglong Zhou, Jiayi Guo

PDF

Open Access

TL;DR

This paper introduces a KL-divergence-based robust satisficing model that improves machine learning robustness, especially for deep neural networks, by providing analytical insights, efficient algorithms, and demonstrating superior performance across multiple tasks.

Contribution

It extends the robust satisficing framework to general loss functions, offering analytical interpretations, convergence analysis, and scalable methods for complex models like deep neural networks.

Findings

01

Outperforms state-of-the-art benchmarks in three machine learning tasks.

02

Provides stable and efficient numerical algorithms with convergence guarantees.

03

Extends the model to hierarchical data structures.

Abstract

Empirical risk minimization, a cornerstone in machine learning, is often hindered by the Optimizer's Curse stemming from discrepancies between the empirical and true data-generating distributions.To address this challenge, the robust satisficing framework has emerged recently to mitigate ambiguity in the true distribution. Distinguished by its interpretable hyperparameter and enhanced performance guarantees, this approach has attracted increasing attention from academia. However, its applicability in tackling general machine learning problems, notably deep neural networks, remains largely unexplored due to the computational challenges in solving this model efficiently across general loss functions. In this study, we delve into the Kullback Leibler divergence based robust satisficing model under a general loss function, presenting analytical interpretations, diverse performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Criteria Decision Making · Forecasting Techniques and Applications · Collaboration in agile enterprises

MethodsSoftmax · Attention Is All You Need