Matryoshka-Adaptor: Unsupervised and Supervised Tuning for Smaller   Embedding Dimensions

Jinsung Yoon; Raj Sinha; Sercan O Arik; Tomas Pfister

arXiv:2407.20243·cs.CL·July 31, 2024

Matryoshka-Adaptor: Unsupervised and Supervised Tuning for Smaller Embedding Dimensions

Jinsung Yoon, Raj Sinha, Sercan O Arik, Tomas Pfister

PDF

Open Access 1 Video

TL;DR

Matryoshka-Adaptor is a versatile tuning framework that reduces embedding dimensions from large language models, significantly improving efficiency while maintaining performance across diverse datasets and API access levels.

Contribution

It introduces a novel method for substantial dimensionality reduction of LLM embeddings that is compatible with various architectures and both learning paradigms.

Findings

01

Achieves 2-12 fold dimensionality reduction without performance loss.

02

Effective across English, multilingual, and multimodal datasets.

03

Compatible with black-box API embeddings.

Abstract

Embeddings from Large Language Models (LLMs) have emerged as critical components in various applications, particularly for information retrieval. While high-dimensional embeddings generally demonstrate superior performance as they contain more salient information, their practical application is frequently hindered by elevated computational latency and the associated higher cost. To address these challenges, we propose Matryoshka-Adaptor, a novel tuning framework designed for the customization of LLM embeddings. Matryoshka-Adaptor facilitates substantial dimensionality reduction while maintaining comparable performance levels, thereby achieving a significant enhancement in computational efficiency and cost-effectiveness. Our framework directly modifies the embeddings from pre-trained LLMs which is designed to be seamlessly integrated with any LLM architecture, encompassing those…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Matryoshka-Adaptor: Unsupervised and Supervised Tuning for Smaller Embedding Dimensions· underline

Taxonomy

TopicsNeural Networks and Applications