Whitening Not Recommended for Classification Tasks in LLMs

Ali Forooghi; Shaghayegh Sadeghi; Jianguo Lu

arXiv:2407.12886·cs.CL·July 19, 2024

Whitening Not Recommended for Classification Tasks in LLMs

Ali Forooghi, Shaghayegh Sadeghi, Jianguo Lu

PDF

Open Access

TL;DR

This paper investigates the effects of whitening operations on sentence embeddings from large language models, revealing that whitening can harm classification performance and is not universally beneficial.

Contribution

The study provides a comprehensive analysis showing whitening's negative impact on classification tasks in LLM embeddings and introduces SentEval+ for embedding evaluation.

Findings

01

Whitening can degenerate embeddings for classification tasks.

02

Effectiveness of whitening is model- and task-dependent.

03

Introduces SentEval+ platform for embedding evaluation.

Abstract

Sentence embedding is a cornerstone in NLP. Whitening has been claimed to be an effective operation to improve embedding quality obtained from Large Language Models (LLMs). However, we find that the efficacy of whitening is model-dependent and task-dependent. In particular, whitening degenerates embeddings for classification tasks. The conclusion is supported by extensive experiments. We also explored a variety of whitening operations, including PCA, ZCA, PCA-Cor, ZCA-Cor and Cholesky whitenings. A by-product of our research is embedding evaluation platform for LLMs called SentEval+.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Law

MethodsPrincipal Components Analysis