FilterLLM: Text-To-Distribution LLM for Billion-Scale Cold-Start Recommendation
Ruochen Liu, Hao Chen, Yuanchen Bei, Zheyu Zhou, Lijia Chen, Qijie, Shen, Feiran Huang, Fakhri Karray, Senzhang Wang

TL;DR
FilterLLM introduces a novel Text-to-Distribution paradigm enabling large language models to efficiently perform billion-scale cold-start recommendations by predicting item interaction probabilities for all users in a single inference.
Contribution
The paper proposes a new paradigm and framework, FilterLLM, that extends LLM capabilities to billion-scale cold-start recommendation tasks with improved efficiency and effectiveness.
Findings
Achieves over 30 times higher efficiency than state-of-the-art methods.
Successfully deployed on Alibaba platform for two months, serving over one billion cold items.
Online A/B test confirms significant performance improvements.
Abstract
Large Language Model (LLM)-based cold-start recommendation systems continue to face significant computational challenges in billion-scale scenarios, as they follow a "Text-to-Judgment" paradigm. This approach processes user-item content pairs as input and evaluates each pair iteratively. To maintain efficiency, existing methods rely on pre-filtering a small candidate pool of user-item pairs. However, this severely limits the inferential capabilities of LLMs by reducing their scope to only a few hundred pre-filtered candidates. To overcome this limitation, we propose a novel "Text-to-Distribution" paradigm, which predicts an item's interaction probability distribution for the entire user set in a single inference. Specifically, we present FilterLLM, a framework that extends the next-word prediction capabilities of LLMs to billion-scale filtering tasks. FilterLLM first introduces a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Advanced Text Analysis Techniques · Text and Document Classification Technologies
MethodsSparse Evolutionary Training
