Transparent and Controllable Recommendation Filtering via Multimodal Multi-Agent Collaboration
Chi Zhang, Zhipeng Xu, Jiahao Liu, Dongsheng Li, Hansu Gu, Peng Zhang, Ning Gu, and Tun Lu

TL;DR
This paper presents a multimodal, multi-agent recommendation filtering system that reduces false positives, improves transparency, and enhances user control in personalized content feeds.
Contribution
It introduces a novel end-to-cloud, multimodal, multi-agent framework with a fact-grounded adjudication pipeline and dynamic preference graph for improved filtering.
Findings
Decreased false positive rate by 74.3%
Nearly doubled F1-Score over text-only baselines
Enhanced user control and transparency in recommendations
Abstract
While personalized recommender systems excel at content discovery, they frequently expose users to undesirable or discomforting information, highlighting the critical need for user-centric filtering tools. Current methods leveraging Large Language Models (LLMs) struggle with two major bottlenecks: they lack multimodal awareness to identify visually inappropriate content, and they are highly prone to "over-association" -- incorrectly generalizing a user's specific dislike (e.g., anxiety-inducing marketing) to block benign, educational materials. These unconstrained hallucinations lead to a high volume of false positives, ultimately undermining user agency. To overcome these challenges, we introduce a novel framework that integrates end-to-cloud collaboration, multimodal perception, and multi-agent orchestration. Our system employs a fact-grounded adjudication pipeline to eliminate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
