Learning When to Ask: Simulation-Trained Humanoids for Mental-Health Diagnosis
Filippo Cenacchi, Deborah Richards, Longbing Cao

TL;DR
This paper introduces a simulation-based training pipeline for humanoid robots to improve mental health screening, focusing on conversational timing, rapport, and nonverbal cues, with a comparative study of control algorithms.
Contribution
It presents a novel agent-centered simulation environment, a safe learning loop for social interaction, and a comparative analysis of control algorithms for humanoid mental health screening.
Findings
TD3 outperformed PPO and CEM in coverage and stability.
Humanoid agents showed improved timing, fewer clarifications, and stable performance under modality dropout.
Simulation results suggest potential for transfer to real humanoid mental health assessments.
Abstract
Testing humanoid robots with users is slow, causes wear, and limits iteration and diversity. Yet screening agents must master conversational timing, prosody, backchannels, and what to attend to in faces and speech for Depression and PTSD. Most simulators omit policy learning with nonverbal dynamics; many controllers chase task accuracy while underweighting trust, pacing, and rapport. We virtualise the humanoid as a conversational agent to train without hardware burden. Our agent-centred, simulation-first pipeline turns interview data into 276 Unreal Engine MetaHuman patients with synchronised speech, gaze/face, and head-torso poses, plus PHQ-8 and PCL-C flows. A perception-fusion-policy loop decides what and when to speak, when to backchannel, and how to avoid interruptions, under a safety shield. Training uses counterfactual replay (bounded nonverbal perturbations) and an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSocial Robot Interaction and HRI · Emotion and Mood Recognition · Digital Mental Health Interventions
