Human Decision Makings on Curriculum Reinforcement Learning with Difficulty Adjustment
Yilei Zeng, Jiali Duan, Yang Li, Emilio Ferrara, Lerrel Pinto, C.-C., Jay Kuo, Stefanos Nikolaidis

TL;DR
This paper introduces an interactive platform for human-in-the-loop curriculum reinforcement learning, enabling personalized difficulty adjustment to match human preferences and improve learning efficiency.
Contribution
It presents a portable, parallelizable system that allows humans to guide RL training by manipulating difficulty, demonstrating effective personalized curriculum adaptation.
Findings
Reinforcement learning performance aligns with human difficulty preferences.
The system enables large-scale RL training without a server.
Interactive curriculum improves learning efficiency and personalization.
Abstract
Human-centered AI considers human experiences with AI performance. While abundant research has been helping AI achieve superhuman performance either by fully automatic or weak supervision learning, fewer endeavors are experimenting with how AI can tailor to humans' preferred skill level given fine-grained input. In this work, we guide the curriculum reinforcement learning results towards a preferred performance level that is neither too hard nor too easy via learning from the human decision process. To achieve this, we developed a portable, interactive platform that enables the user to interact with agents online via manipulating the task difficulty, observing performance, and providing curriculum feedback. Our system is highly parallelizable, making it possible for a human to train large-scale reinforcement learning applications that require millions of samples without a server. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
