Of Mice and Machines: A Comparison of Learning Between Real World Mice and RL Agents

Shuo Han; German Espinosa; Junda Huang; Daniel A. Dombeck; Malcolm A. MacIver; Bradly C. Stadie

arXiv:2505.12204·cs.LG·October 16, 2025

Of Mice and Machines: A Comparison of Learning Between Real World Mice and RL Agents

Shuo Han, German Espinosa, Junda Huang, Daniel A. Dombeck, Malcolm A. MacIver, Bradly C. Stadie

PDF

Open Access

TL;DR

This paper compares biological mice and reinforcement learning agents in a predator-avoidance task, revealing key behavioral differences and proposing mechanisms to make RL agents exhibit more naturalistic risk-avoidance behaviors.

Contribution

It introduces two novel mechanisms that enhance RL agents' risk assessment and avoidance behaviors, aligning them more closely with biological counterparts.

Findings

01

RL agents lack self-preservation instincts compared to mice.

02

Proposed mechanisms induce naturalistic risk-avoidance behaviors in RL agents.

03

RL agents exhibit strategic environment assessment and cautious planning.

Abstract

Recent advances in reinforcement learning (RL) have demonstrated impressive capabilities in complex decision-making tasks. This progress raises a natural question: how do these artificial systems compare to biological agents, which have been shaped by millions of years of evolution? To help answer this question, we undertake a comparative study of biological mice and RL agents in a predator-avoidance maze environment. Through this analysis, we identify a striking disparity: RL agents consistently demonstrate a lack of self-preservation instinct, readily risking ``death'' for marginal efficiency gains. These risk-taking strategies are in contrast to biological agents, which exhibit sophisticated risk-assessment and avoidance behaviors. Towards bridging this gap between the biological and artificial, we propose two novel mechanisms that encourage more naturalistic risk-avoidance behaviors…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics