Regression Under Human Assistance
Abir De, Nastaran Okati, Paramita Koley, Niloy Ganguly, Manuel, Gomez-Rodriguez

TL;DR
This paper develops a novel approach for optimizing ridge regression models to operate effectively under varying levels of human assistance, addressing the challenge of decision-making in hybrid human-AI systems.
Contribution
It introduces the problem of ridge regression under human assistance, proves its NP-hardness, and proposes an efficient greedy algorithm with approximation guarantees based on submodularity properties.
Findings
The algorithm effectively outsources high-error samples to humans.
It outperforms several baseline methods in experiments.
The approach is robust across different settings.
Abstract
Decisions are increasingly taken by both humans and machine learning models. However, machine learning models are currently trained for full automation -- they are not aware that some of the decisions may still be taken by humans. In this paper, we take a first step towards the development of machine learning models that are optimized to operate under different automation levels. More specifically, we first introduce the problem of ridge regression under human assistance and show that it is NP-hard. Then, we derive an alternative representation of the corresponding objective function as a difference of nondecreasing submodular functions. Building on this representation, we further show that the objective is nondecreasing and satisfies -submodularity, a recently introduced notion of approximate submodularity. These properties allow a simple and efficient greedy algorithm to enjoy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Machine Learning and Algorithms · Machine Learning and Data Classification
