CM-DQN: A Value-Based Deep Reinforcement Learning Model to Simulate Confirmation Bias
Jiacheng Shen, Lihan Feng

TL;DR
This paper introduces CM-DQN, a deep reinforcement learning model that simulates human confirmation bias by applying different update strategies for positive and negative prediction errors, demonstrating improved learning effects in experiments.
Contribution
The paper proposes a novel deep reinforcement learning algorithm, CM-DQN, that models confirmation bias by differentially updating based on outcome valence, advancing understanding of human-like decision-making.
Findings
Confirmatory bias improves learning performance.
CM-DQN effectively simulates human decision-making biases.
Experimental results show enhanced learning with confirmation bias.
Abstract
In human decision-making tasks, individuals learn through trials and prediction errors. When individuals learn the task, some are more influenced by good outcomes, while others weigh bad outcomes more heavily. Such confirmation bias can lead to different learning effects. In this study, we propose a new algorithm in Deep Reinforcement Learning, CM-DQN, which applies the idea of different update strategies for positive or negative prediction errors, to simulate the human decision-making process when the task's states are continuous while the actions are discrete. We test in Lunar Lander environment with confirmatory, disconfirmatory bias and non-biased to observe the learning effects. Moreover, we apply the confirmation model in a multi-armed bandit problem (environment in discrete states and discrete actions), which utilizes the same idea as our proposed algorithm, as a contrast…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Adversarial Robustness in Machine Learning · Risk and Safety Analysis
