Pseudo Label NCF for Sparse OHC Recommendation: Dual Representation Learning and the Separability Accuracy Trade off

Pronob Kumar Barman; Tera L. Reynolds; James Foulds

arXiv:2603.24750·cs.IR·April 1, 2026

Pseudo Label NCF for Sparse OHC Recommendation: Dual Representation Learning and the Separability Accuracy Trade off

Pronob Kumar Barman, Tera L. Reynolds, James Foulds

PDF

TL;DR

This paper introduces Pseudo Label NCF, a method that enhances sparse recommendation systems in online health communities by using survey-based pseudo labels to improve ranking and interpretability.

Contribution

It extends neural collaborative filtering models with pseudo label objectives, creating dual embeddings that balance ranking accuracy and interpretability in sparse data scenarios.

Findings

01

Pseudo labels improve ranking metrics significantly.

02

Pseudo label embeddings have higher semantic separability.

03

A trade-off exists between embedding interpretability and ranking performance.

Abstract

Online Health Communities connect patients for peer support, but users face a discovery challenge when they have minimal prior interactions to guide personalization. We study recommendation under extreme interaction sparsity in a survey driven setting where each user provides a 16 dimensional intake vector and each support group has a structured feature profile. We extend Neural Collaborative Filtering architectures, including Matrix Factorization, Multi Layer Perceptron, and NeuMF, with an auxiliary pseudo label objective derived from survey group feature alignment using cosine similarity mapped to [0, 1]. The resulting Pseudo Label NCF learns dual embedding spaces: main embeddings for ranking and pseudo label embeddings for semantic alignment. We evaluate on a dataset of 165 users and 498 support groups using a leave one out protocol that reflects cold start conditions. All pseudo…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.