RecoMind: A Reinforcement Learning Framework for Optimizing In-Session User Satisfaction in Recommendation Systems
Mehdi Ben Ayed, Fei Feng, Jay Adams, Vishwakarma Singh, Kritarth Anand, Jiajing Xu

TL;DR
RecoMind is a scalable reinforcement learning framework that enhances in-session user satisfaction in recommendation systems by leveraging simulation-based training and custom exploration strategies, outperforming traditional supervised methods.
Contribution
The paper introduces RecoMind, a novel simulation-based RL framework that simplifies training and deployment of RL policies at web-scale for session-based recommendation optimization.
Findings
RL policy significantly increases user engagement metrics.
Offline and online tests show RecoMind outperforms supervised learning approaches.
Efficient exploration strategy enables handling web-scale action spaces.
Abstract
Existing web-scale recommendation systems commonly use supervised learning methods that prioritize immediate user feedback. Although reinforcement learning (RL) offers a solution to optimize longer-term goals, such as in-session engagement, applying it at web scale is challenging due to the extremely large action space and engineering complexity. In this paper, we introduce RecoMind, a simulator-based RL framework designed for the effective optimization of session-based goals at web-scale. RecoMind leverages existing recommendation models to establish a simulation environment and to bootstrap the RL policy to optimize immediate user interactions from the outset. This method integrates well with existing industry pipelines, simplifying the training and deployment of RL policies. Additionally, RecoMind introduces a custom exploration strategy to efficiently explore web-scale action spaces…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Digital Mental Health Interventions · Emotion and Mood Recognition
