Innate-Values-driven Reinforcement Learning based Cognitive Modeling

Qin Yang

arXiv:2411.09160·cs.AI·June 11, 2025

Innate-Values-driven Reinforcement Learning based Cognitive Modeling

Qin Yang

PDF

Open Access

TL;DR

This paper introduces a novel reinforcement learning model driven by innate values, enabling agents to better balance internal needs and external rewards, leading to improved performance in complex tasks.

Contribution

It proposes the innate-values-driven RL (IVRL) framework and two models, IV-DQN and IV-A2C, integrating intrinsic motivations into RL for more human-like decision-making.

Findings

01

IVRL models outperform benchmark algorithms in VIZDoom RPG tasks.

02

IVRL enables better internal needs management and goal organization.

03

Models demonstrate improved learning efficiency and adaptability.

Abstract

Innate values describe agents' intrinsic motivations, which reflect their inherent interests and preferences for pursuing goals and drive them to develop diverse skills that satisfy their various needs. Traditional reinforcement learning (RL) is learning from interaction based on the feedback rewards of the environment. However, in real scenarios, the rewards are generated by agents' innate value systems, which differ vastly from individuals based on their needs and requirements. In other words, considering the AI agent as a self-organizing system, developing its awareness through balancing internal and external utilities based on its needs in different tasks is a crucial problem for individuals learning to support others and integrate community with safety and harmony in the long term. To address this gap, we propose a new RL model termed innate-values-driven RL (IVRL) based on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsEntropy Regularization · Convolution · Dense Connections · Proximal Policy Optimization · Q-Learning · A2C · Deep Q-Network