TL;DR
This survey explores the evolution, patterns, and challenges of multi-agent video recommender systems, highlighting recent advances with large language models and outlining future research directions.
Contribution
It provides a comprehensive taxonomy of multi-agent video recommenders, analyzes coordination mechanisms, and discusses emerging LLM-powered architectures and open challenges.
Findings
Multi-agent architectures improve recommendation explainability.
LLM integration enables more dynamic and personalized recommendations.
Open challenges include scalability and multimodal understanding.
Abstract
Video recommender systems are among the most popular and impactful applications of AI, shaping content consumption and influencing culture for billions of users. Traditional single-model recommenders, which optimize static engagement metrics, are increasingly limited in addressing the dynamic requirements of modern platforms. In response, multi-agent architectures are redefining how video recommender systems serve, learn, and adapt to both users and datasets. These agent-based systems coordinate specialized agents responsible for video understanding, reasoning, memory, and feedback, to provide precise, explainable recommendations. In this survey, we trace the evolution of multi-agent video recommendation systems (MAVRS). We combine ideas from multi-agent recommender systems, foundation models, and conversational AI, culminating in the emerging field of large language model (LLM)-powered…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
