"I Don't Think So": Summarizing Policy Disagreements for Agent Comparison
Yotam Amitai, Ofra Amir

TL;DR
This paper introduces a novel method for generating contrastive summaries that highlight differences between agent policies, improving user understanding and comparison of autonomous agents in human-AI collaboration.
Contribution
The paper presents a new approach for creating dependent, contrastive summaries that emphasize agent policy differences, addressing limitations of existing independent summarization methods.
Findings
Disagreement-based summaries improve user ability to compare agents.
Contrastive summaries outperform independent summaries in conveying agent differences.
User studies validate the effectiveness of the proposed method.
Abstract
With Artificial Intelligence on the rise, human interaction with autonomous agents becomes more frequent. Effective human-agent collaboration requires users to understand the agent's behavior, as failing to do so may cause reduced productivity, misuse or frustration. Agent strategy summarization methods are used to describe the strategy of an agent to its destined user through demonstration. A summary's objective is to maximize the user's understanding of the agent's aptitude by showcasing its behaviour in a selected set of world states. While shown to be useful, we show that current methods are limited when tasked with comparing between agents, as each summary is independently generated for a specific agent. In this paper, we propose a novel method for generating dependent and contrastive summaries that emphasize the differences between agent policies by identifying states in which the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Semantic Web and Ontologies
