Teaching Social Behavior through Human Reinforcement for Ad hoc Teamwork -The STAR Framework
Shani Alkoby, Avilash Rath, Peter Stone

TL;DR
This paper introduces the STAR framework, a method for teaching autonomous agents to adhere to human social norms within ad hoc teams through human feedback, addressing the challenge of learning complex, non-predefined social behaviors.
Contribution
The paper proposes the STAR framework, enabling multiagent teams to learn social norms via human reinforcement, advancing beyond rule-based approaches to more flexible, norm-based social behavior learning.
Findings
Agents learn to avoid socially unacceptable actions through human feedback.
The STAR framework improves agents' social compliance in team settings.
Demonstrates effective norm adherence in mixed human-agent teams.
Abstract
As AI technology continues to develop, more and more agents will become capable of long term autonomy alongside people. Thus, a recent line of research has studied the problem of teaching autonomous agents the concept of ethics and human social norms. Most existing work considers the case of an individual agent attempting to learn a predefined set of rules. In reality, however, social norms are not always pre-defined and are very difficult to represent algorithmically. Moreover, the basic idea behind the social norms concept is ensuring that one's actions do not negatively influence others' utilities, which is inherently a multiagent concept. Thus, here we investigate a way to teach agents, as a team, how to act according to human social norms. In this research, we introduce the STAR framework used to teach an ad hoc team of agents to act in accordance with human social norms. Using a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Teaching and Learning Programming · Social Robot Interaction and HRI
