Loading paper
User Simulator-Guided Multi-Turn Preference Optimization for Reasoning LLM-based Conversational Recommendation | Tomesphere