SHARPIE: A Modular Framework for Reinforcement Learning and Human-AI Interaction Experiments
H\"useyin Ayd{\i}n, Kevin Godin-Dubois, Libio Goncalvez Braz, Floris, den Hengst, Kim Baraka, Mustafa Mert \c{C}elikok, Andreas Sauter, Shihan, Wang, Frans A. Oliehoek

TL;DR
SHARPIE is a flexible, modular platform designed to facilitate research on human-AI interaction in reinforcement learning, supporting experiments with diverse interaction paradigms and scalable deployment.
Contribution
It introduces a comprehensive, modular framework that standardizes and streamlines experiments involving human-AI interactions in reinforcement learning.
Findings
Supports a wide range of human-AI interaction experiments
Enables scalable deployment and recruitment for studies
Standardizes interfaces for human-RL interactions
Abstract
Reinforcement learning (RL) offers a general approach for modeling and training AI agents, including human-AI interaction scenarios. In this paper, we propose SHARPIE (Shared Human-AI Reinforcement Learning Platform for Interactive Experiments) to address the need for a generic framework to support experiments with RL agents and humans. Its modular design consists of a versatile wrapper for RL environments and algorithm libraries, a participant-facing web interface, logging utilities, deployment on popular cloud and participant recruitment platforms. It empowers researchers to study a wide variety of research questions related to the interaction between humans and RL agents, including those related to interactive reward specification and learning, learning from human feedback, action delegation, preference elicitation, user-modeling, and human-AI teaming. The platform is based on a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
