Demystifying Embedding Spaces using Large Language Models
Guy Tennenholtz, Yinlam Chow, Chih-Wei Hsu, Jihwan Jeong, Lior Shani,, Azamat Tulepbergenov, Deepak Ramachandran, Martin Mladenov, Craig Boutilier

TL;DR
This paper introduces a method using Large Language Models to interpret and explore complex embedding spaces directly, making them more understandable and useful across various tasks.
Contribution
It presents a novel approach of injecting embeddings into LLMs to enable direct querying and interpretation, enhancing the utility of embeddings in different applications.
Findings
Enables direct interaction with embeddings via LLMs
Improves interpretability of concept activation vectors
Decodes user preferences in recommender systems
Abstract
Embeddings have become a pivotal means to represent complex, multi-faceted information about entities, concepts, and relationships in a condensed and useful format. Nevertheless, they often preclude direct interpretation. While downstream tasks make use of these compressed representations, meaningful interpretation usually requires visualization using dimensionality reduction or specialized machine learning interpretability methods. This paper addresses the challenge of making such embeddings more interpretable and broadly useful, by employing Large Language Models (LLMs) to directly interact with embeddings -- transforming abstract vectors into understandable narratives. By injecting embeddings into LLMs, we enable querying and exploration of complex embedding data. We demonstrate our approach on a variety of diverse tasks, including: enhancing concept activation vectors (CAVs),…
Peer Reviews
Decision·ICLR 2024 poster
1. It is sound to demystify domain embedding with Large Language Models. 2. The authors introduce their approach step-by-step in Sec 2 & 3, which is clear. 3. The performance is evaluated by human raters.
1. The training data is generated by PaML 2-L, and ELM uses PaLM 2-XS to interpret the domain embeddings. Their similar architectures and training procedures may make the contribution limited. 2. The authors argue that ELM is a general framework for different tasks, but the experiments only involve one model (MF) on one dataset (MovieLens). Therefore, the generalization of ELM to other models and datasets is not guaranteed. 3. Some related works are overlooked. Existing works [1][2] have shown t
- [S1] The proposed method ELM is novel and intuitive. The two-stage training (adapter first, full model next) is interesting and makes sense. - [S2] I appreciate including human evaluation in section 3. The analysis and discussion on hypothetical embedding vectors and concept activation vectors (section 4) are very interesting. - [S3] The paper overall is well-written and easy to follow.
## Major weaknesses - [W1] The evaluation can be improved. (1) The training data are all synthesized from an LLM. (2) The 24 tasks are all original. (3) ELM is only evaluated on the MovieLens. (4) There is no baseline method. It is unclear how well ELM will perform on real data and tasks compared to other methods. ## Minor weaknesses - [M1] It would be great to explain $E_A$ in Figure 2 (the definition is on page 4). - [M2] The 24 tasks are essential to understand this paper. I recommend addi
- Interpreting abstract embeddings into human-understandable natural language descriptions is intuitively appealing. - The proposed approach is simple and effective by reasonably leveraging the power of large language models (LLMs). - The authors have comprehensively assessed the quality of ELM's outputs using a variety of evaluation techniques, including qualitative human evaluations and specific consistency metrics.
Honestly, one of my greatest concerns is the practicality of the proposed framework, given that training an ELM requires manually constructing a batch of tasks, which need to be diverse enough to extract rich semantic information from the target domain $\mathcal{W}$ to support the interpretation of the embeddings. Admittedly, the experimental part of the paper validates ELM's proficiency in interpreting two forms of embeddings on a movie-related dataset. However, I am not sure if ELM performs eq
Videos
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Explainable Artificial Intelligence (XAI)
