TL;DR
This paper presents a theoretical analysis and a practical framework, TF-LLMER, to improve optimization and performance of LLM-enhanced recommender systems by addressing key representation issues.
Contribution
It introduces TF-LLMER, a lightweight framework with normalization and Rec-PCA, to enhance training stability and semantic alignment in LLM-enhanced recommenders.
Findings
TF-LLMER outperforms state-of-the-art methods in experiments.
Normalization improves optimization stability.
Rec-PCA enhances semantic and collaborative structure alignment.
Abstract
Large language model (LLM)-enhanced recommendation models inject LLM representations into backbone recommenders to exploit rich item text without inference-time LLM cost. However, we find that existing LLM-enhanced methods significantly hinder the optimization of backbone models, resulting in high training losses that are difficult to reduce. To address it, we establish a comprehensive theoretical analysis of local optimization curvature and identify two key causes: 1) large norm disparity and 2) semantic-collaboration misaligned angular clustering of LLM representations. Guided by these insights, we propose Training-Friendly LLM-Enhanced Recommender (TF-LLMER), a lightweight framework with two key components. First, we highlight the necessity of item embedding normalization to eliminate norm-driven instability and achieve provable control over optimization conditioning. Second, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
