E-HBA: Using Action Policies for Expert Advice and Agent Typification
Stefano V. Albrecht, Jacob W. Crandall, Subramanian Ramamoorthy

TL;DR
E-HBA is a novel meta-algorithm that combines expert advice and agent typification to enhance decision-making in repeated interactions, showing significant performance improvements in matrix games.
Contribution
The paper introduces E-HBA, a new meta-algorithm that integrates expert payoff history with type-based predictions to improve existing expert algorithms.
Findings
E-HBA significantly improves expert algorithm performance in matrix games.
Combining past payoffs with type predictions enhances decision accuracy.
E-HBA is applicable to any expert algorithm considering historical payoffs.
Abstract
Past research has studied two approaches to utilise predefined policy sets in repeated interactions: as experts, to dictate our own actions, and as types, to characterise the behaviour of other agents. In this work, we bring these complementary views together in the form of a novel meta-algorithm, called Expert-HBA (E-HBA), which can be applied to any expert algorithm that considers the average (or total) payoff an expert has yielded in the past. E-HBA gradually mixes the past payoff with a predicted future payoff, which is computed using the type-based characterisation. We present results from a comprehensive set of repeated matrix games, comparing the performance of several well-known expert algorithms with and without the aid of E-HBA. Our results show that E-HBA has the potential to significantly improve the performance of expert algorithms.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation · Reinforcement Learning in Robotics · Auction Theory and Applications
