Conservative Bias in Multi-Teacher Learning: Why Agents Prefer Low-Reward Advisors
Maher Mesto, Francisco Cruz

TL;DR
This paper uncovers that in interactive reinforcement learning, agents prefer conservative, low-reward teachers over higher-reward ones, highlighting a bias towards safety and consistency that impacts learning dynamics and decision-making.
Contribution
The study reveals a strong conservative bias in agent teacher selection in IRL, supported by extensive experiments, and discusses implications for human-robot collaboration and safety-critical applications.
Findings
Agents prefer low-reward teachers 93.16% of the time.
Critical thresholds at teacher availability rho >= 0.6 and accuracy omega >= 0.6.
Framework improves performance by 159% over baseline Q-learning.
Abstract
Interactive reinforcement learning (IRL) has shown promise in enabling autonomous agents and robots to learn complex behaviours from human teachers, yet the dynamics of teacher selection remain poorly understood. This paper reveals an unexpected phenomenon in IRL: when given a choice between teachers with different reward structures, learning agents overwhelmingly prefer conservative, low-reward teachers (93.16% selection rate) over those offering 20x higher rewards. Through 1,250 experimental runs in navigation tasks with multiple expert teachers, we discovered: (1) Conservative bias dominates teacher selection: agents systematically choose the lowest-reward teacher, prioritising consistency over optimality; (2) Critical performance thresholds exist at teacher availability rho >= 0.6 and accuracy omega >= 0.6, below which the framework fails catastrophically; (3) The framework achieves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Social Robot Interaction and HRI · Robot Manipulation and Learning
