Adaptive Human-Computer Interaction Strategies Through Reinforcement Learning in Complex
Rui Liu, Yifan Zhuang, Runsheng Zhang

TL;DR
This paper introduces a reinforcement learning framework for adaptive human-computer interaction, modeling it as a Markov decision process to optimize long-term user experience and interaction efficiency.
Contribution
It presents a novel RL-based optimization method that dynamically adjusts interaction strategies, outperforming existing approaches in complex, multimodal dialog environments.
Findings
Outperforms existing methods in task success rate
Achieves higher cumulative rewards and faster convergence
Enhances interaction stability and efficiency
Abstract
This study addresses the challenges of dynamics and complexity in intelligent human-computer interaction and proposes a reinforcement learning-based optimization framework to improve long-term returns and overall experience. Human-computer interaction is modeled as a Markov decision process, with state space, action space, reward function, and discount factor defined to capture the dynamics of user input, system feedback, and interaction environment. The method combines policy function, value function, and advantage function, updates parameters through policy gradient, and continuously adjusts during interaction to balance immediate feedback and long-term benefits. To validate the framework, multimodal dialog and scene-aware datasets are used as the experimental platform, with multiple sensitivity experiments conducted on key factors such as discount factor, exploration rate decay,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Social Robot Interaction and HRI · Human Motion and Animation
