RLSLM: A Hybrid Reinforcement Learning Framework Aligning Rule-Based Social Locomotion Model with Human Social Norms
Yitian Kou, Yihe Gu, Chen Zhou, DanDan Zhu, Shuguang Kuai

TL;DR
RLSLM is a hybrid reinforcement learning framework that combines rule-based social norms with data-driven optimization to enable socially-aware navigation in human environments, improving comfort and interpretability.
Contribution
The paper introduces RLSLM, a novel hybrid framework integrating a rule-based social comfort model into reinforcement learning for more effective social navigation.
Findings
RLSLM outperforms state-of-the-art rule-based models in user experience.
The model achieves socially aligned navigation with minimal training.
Enhanced interpretability over conventional data-driven methods.
Abstract
Navigating human-populated environments without causing discomfort is a critical capability for socially-aware agents. While rule-based approaches offer interpretability through predefined psychological principles, they often lack generalizability and flexibility. Conversely, data-driven methods can learn complex behaviors from large-scale datasets, but are typically inefficient, opaque, and difficult to align with human intuitions. To bridge this gap, we propose RLSLM, a hybrid Reinforcement Learning framework that integrates a rule-based Social Locomotion Model, grounded in empirical behavioral experiments, into the reward function of a reinforcement learning framework. The social locomotion model generates an orientation-sensitive social comfort field that quantifies human comfort across space, enabling socially aligned navigation policies with minimal training. RLSLM then jointly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpatial Cognition and Navigation · Social Robot Interaction and HRI · Evacuation and Crowd Dynamics
