Hard vs. Noise: Resolving Hard-Noisy Sample Confusion in Recommender Systems via Large Language Models

Tianrui Song; Wen-Shuo Chao; Hao Liu

arXiv:2511.07295·cs.IR·November 12, 2025

Hard vs. Noise: Resolving Hard-Noisy Sample Confusion in Recommender Systems via Large Language Models

Tianrui Song, Wen-Shuo Chao, Hao Liu

PDF

Open Access 1 Video

TL;DR

This paper introduces LLMHNI, a framework that uses large language models to distinguish hard samples from noisy ones in recommender systems, improving denoising and recommendation accuracy.

Contribution

The paper proposes a novel LLM-based approach to differentiate hard and noisy samples, addressing the hard-noisy confusion problem in implicit feedback data.

Findings

01

LLMHNI effectively improves denoising performance.

02

The framework enhances recommendation accuracy.

03

It successfully differentiates hard samples from noisy ones.

Abstract

Implicit feedback, employed in training recommender systems, unavoidably confronts noise due to factors such as misclicks and position bias. Previous studies have attempted to identify noisy samples through their diverged data patterns, such as higher loss values, and mitigate their influence through sample dropping or reweighting. However, we observed that noisy samples and hard samples display similar patterns, leading to hard-noisy confusion issue. Such confusion is problematic as hard samples are vital for modeling user preferences. To solve this problem, we propose LLMHNI framework, leveraging two auxiliary user-item relevance signals generated by Large Language Models (LLMs) to differentiate hard and noisy samples. LLMHNI obtains user-item semantic relevance from LLM-encoded embeddings, which is used in negative sampling to select hard negatives while filtering out noisy false…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Hard vs. Noise: Resolving Hard-Noisy Sample Confusion in Recommender Systems via Large Language Models· underline

Taxonomy

TopicsRecommender Systems and Techniques · Advanced Graph Neural Networks · Topic Modeling