Do We Really Need Specialization? Evaluating Generalist Text Embeddings for Zero-Shot Recommendation and Search

Matteo Attimonelli; Alessandro De Bellis; Claudio Pomo; Dietmar Jannach; Eugenio Di Sciascio; Tommaso Di Noia

arXiv:2507.05006·cs.IR·July 9, 2025

Do We Really Need Specialization? Evaluating Generalist Text Embeddings for Zero-Shot Recommendation and Search

Matteo Attimonelli, Alessandro De Bellis, Claudio Pomo, Dietmar Jannach, Eugenio Di Sciascio, Tommaso Di Noia

PDF

TL;DR

This paper demonstrates that large-scale pre-trained generalist text embedding models can effectively perform zero-shot recommendation and search tasks without domain-specific fine-tuning, outperforming traditional and fine-tuned models.

Contribution

The study shows that generalist text embeddings pre-trained on large corpora can achieve strong zero-shot performance in recommendation and search, challenging the need for task-specific fine-tuning.

Findings

01

GTEs outperform fine-tuned models in zero-shot recommendation and search

02

Dimensionality reduction via PCA improves model performance

03

GTEs distribute features more evenly across embedding space

Abstract

Pre-trained language models (PLMs) are widely used to derive semantic representations from item metadata in recommendation and search. In sequential recommendation, PLMs enhance ID-based embeddings through textual metadata, while in product search, they align item characteristics with user intent. Recent studies suggest task and domain-specific fine-tuning are needed to improve representational power. This paper challenges this assumption, showing that Generalist Text Embedding Models (GTEs), pre-trained on large-scale corpora, can guarantee strong zero-shot performance without specialized adaptation. Our experiments demonstrate that GTEs outperform traditional and fine-tuned models in both sequential recommendation and product search. We attribute this to a superior representational power, as they distribute features more evenly across the embedding space. Finally, we show that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.