Transparent and Controllable Recommendation Filtering via Multimodal Multi-Agent Collaboration

Chi Zhang; Zhipeng Xu; Jiahao Liu; Dongsheng Li; Hansu Gu; Peng Zhang; Ning Gu; and Tun Lu

arXiv:2604.17459·cs.IR·April 21, 2026

Transparent and Controllable Recommendation Filtering via Multimodal Multi-Agent Collaboration

Chi Zhang, Zhipeng Xu, Jiahao Liu, Dongsheng Li, Hansu Gu, Peng Zhang, Ning Gu, and Tun Lu

PDF

TL;DR

This paper presents a multimodal, multi-agent recommendation filtering system that reduces false positives, improves transparency, and enhances user control in personalized content feeds.

Contribution

It introduces a novel end-to-cloud, multimodal, multi-agent framework with a fact-grounded adjudication pipeline and dynamic preference graph for improved filtering.

Findings

01

Decreased false positive rate by 74.3%

02

Nearly doubled F1-Score over text-only baselines

03

Enhanced user control and transparency in recommendations

Abstract

While personalized recommender systems excel at content discovery, they frequently expose users to undesirable or discomforting information, highlighting the critical need for user-centric filtering tools. Current methods leveraging Large Language Models (LLMs) struggle with two major bottlenecks: they lack multimodal awareness to identify visually inappropriate content, and they are highly prone to "over-association" -- incorrectly generalizing a user's specific dislike (e.g., anxiety-inducing marketing) to block benign, educational materials. These unconstrained hallucinations lead to a high volume of false positives, ultimately undermining user agency. To overcome these challenges, we introduce a novel framework that integrates end-to-cloud collaboration, multimodal perception, and multi-agent orchestration. Our system employs a fact-grounded adjudication pipeline to eliminate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.