Targeted Data Acquisition for Evolving Negotiation Agents
Minae Kwon, Siddharth Karamcheti, Mariano-Florentino Cuellar, Dorsa, Sadigh

TL;DR
This paper presents a targeted data acquisition framework that guides reinforcement learning agents in negotiation tasks, enabling them to balance self-interest and cooperation more effectively than traditional methods.
Contribution
Introduces a guided exploration approach using expert annotations to improve negotiation strategies in reinforcement learning agents.
Findings
Agents achieve higher utility and cooperation levels.
Outperforms standard supervised and reinforcement learning methods.
Effective with both simulated and human partners.
Abstract
Successful negotiators must learn how to balance optimizing for self-interest and cooperation. Yet current artificial negotiation agents often heavily depend on the quality of the static datasets they were trained on, limiting their capacity to fashion an adaptive response balancing self-interest and cooperation. For this reason, we find that these agents can achieve either high utility or cooperation, but not both. To address this, we introduce a targeted data acquisition framework where we guide the exploration of a reinforcement learning agent using annotations from an expert oracle. The guided exploration incentivizes the learning agent to go beyond its static dataset and develop new negotiation strategies. We show that this enables our agents to obtain higher-reward and more Pareto-optimal solutions when negotiating with both simulated and human partners compared to standard…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Multi-Agent Systems and Negotiation · Experimental Behavioral Economics Studies
