Conservative Bias in Multi-Teacher Learning: Why Agents Prefer Low-Reward Advisors

Maher Mesto; Francisco Cruz

arXiv:2512.17180·cs.RO·December 24, 2025

Conservative Bias in Multi-Teacher Learning: Why Agents Prefer Low-Reward Advisors

Maher Mesto, Francisco Cruz

PDF

Open Access

TL;DR

This paper uncovers that in interactive reinforcement learning, agents prefer conservative, low-reward teachers over higher-reward ones, highlighting a bias towards safety and consistency that impacts learning dynamics and decision-making.

Contribution

The study reveals a strong conservative bias in agent teacher selection in IRL, supported by extensive experiments, and discusses implications for human-robot collaboration and safety-critical applications.

Findings

01

Agents prefer low-reward teachers 93.16% of the time.

02

Critical thresholds at teacher availability rho >= 0.6 and accuracy omega >= 0.6.

03

Framework improves performance by 159% over baseline Q-learning.

Abstract

Interactive reinforcement learning (IRL) has shown promise in enabling autonomous agents and robots to learn complex behaviours from human teachers, yet the dynamics of teacher selection remain poorly understood. This paper reveals an unexpected phenomenon in IRL: when given a choice between teachers with different reward structures, learning agents overwhelmingly prefer conservative, low-reward teachers (93.16% selection rate) over those offering 20x higher rewards. Through 1,250 experimental runs in navigation tasks with multiple expert teachers, we discovered: (1) Conservative bias dominates teacher selection: agents systematically choose the lowest-reward teacher, prioritising consistency over optimality; (2) Critical performance thresholds exist at teacher availability rho >= 0.6 and accuracy omega >= 0.6, below which the framework fails catastrophically; (3) The framework achieves…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Social Robot Interaction and HRI · Robot Manipulation and Learning