Zero-Shot Assistance in Sequential Decision Problems

Sebastiaan De Peuter; Samuel Kaski

arXiv:2202.07364·cs.LG·December 1, 2022

Zero-Shot Assistance in Sequential Decision Problems

Sebastiaan De Peuter, Samuel Kaski

PDF

Open Access 2 Repos

TL;DR

This paper introduces a formal framework and scalable method for creating advisory assistants in sequential decision problems that adapt to agent biases, improving overall reward compared to automation-based approaches.

Contribution

It presents a novel formalization of assistance accounting for agent biases and a scalable planning method for advisors in complex decision tasks.

Findings

01

The approach adapts to agent biases effectively.

02

Advisory assistance yields higher rewards than automation-based methods.

03

Combining advice with automation increases performance but reduces safety guarantees.

Abstract

We consider the problem of creating assistants that can help agents solve new sequential decision problems, assuming the agent is not able to specify the reward function explicitly to the assistant. Instead of acting in place of the agent as in current automation-based approaches, we give the assistant an advisory role and keep the agent in the loop as the main decision maker. The difficulty is that we must account for potential biases of the agent which may cause it to seemingly irrationally reject advice. To do this we introduce a novel formalization of assistance that models these biases, allowing the assistant to infer and adapt to them. We then introduce a new method for planning the assistant's actions which can scale to large decision making problems. We show experimentally that our approach adapts to these agent biases, and results in higher cumulative reward for the agent than…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuction Theory and Applications · Law, Economics, and Judicial Systems · Blockchain Technology Applications and Security