Towards Emotionally Intelligent and Responsible Reinforcement Learning
Garapati Keerthana, Manik Gupta

TL;DR
This paper introduces a Responsible Reinforcement Learning framework that incorporates emotional understanding and ethical constraints into personalized decision-making, aiming to improve safety and empathy in sensitive domains like healthcare.
Contribution
It formulates personalization as a Constrained Markov Decision Process with a multi-objective reward, integrating emotional and ethical considerations into RL algorithms.
Findings
Proposes a multi-objective reward balancing engagement and well-being
Defines emotion-informed state representations for RL agents
Operationalizes empathy and responsibility in machine learning policies
Abstract
Personalized decision systems in healthcare and behavioral support often rely on static rule-based or engagement-maximizing heuristics that overlook users' emotional context and ethical constraints. Such approaches risk recommending insensitive or unsafe interventions, especially in domains involving serious mental illness, substance use disorders, or depression. To address this limitation, we propose a Responsible Reinforcement Learning (RRL) framework that integrates emotional and contextual understanding with ethical considerations into the sequential decision-making process. RRL formulates personalization as a Constrained Markov Decision Process (CMDP), where the agent optimizes engagement and adherence while ensuring emotional alignment and ethical safety. We introduce a multi-objective reward function that explicitly balances short-term behavioral engagement with long-term user…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Mental Health Interventions · Mental Health via Writing · Emotion and Mood Recognition
