AKRMap: Adaptive Kernel Regression for Trustworthy Visualization of Cross-Modal Embeddings

Yilin Ye; Junchao Huang; Xingchen Zeng; Jiazhi Xia; Wei Zeng

arXiv:2505.14664·cs.CV·May 29, 2025

AKRMap: Adaptive Kernel Regression for Trustworthy Visualization of Cross-Modal Embeddings

Yilin Ye, Junchao Huang, Xingchen Zeng, Jiazhi Xia, Wei Zeng

PDF

Open Access 1 Repo 1 Video

TL;DR

AKRMap is a novel dimensionality reduction technique that visualizes cross-modal embeddings more accurately by learning kernel regression of metric landscapes, improving interpretability of multi-modal models.

Contribution

Introduces AKRMap, a supervised projection method with adaptive kernels that better captures cross-modal metric distributions for visualization.

Findings

01

Outperforms existing DR methods in accuracy and trustworthiness

02

Supports interactive visualization features like zoom and overlay

03

Effectively visualizes cross-modal embeddings for text-to-image models

Abstract

Cross-modal embeddings form the foundation for multi-modal models. However, visualization methods for interpreting cross-modal embeddings have been primarily confined to traditional dimensionality reduction (DR) techniques like PCA and t-SNE. These DR methods primarily focus on feature distributions within a single modality, whilst failing to incorporate metrics (e.g., CLIPScore) across multiple modalities. This paper introduces AKRMap, a new DR technique designed to visualize cross-modal embeddings metric with enhanced accuracy by learning kernel regression of the metric landscape in the projection space. Specifically, AKRMap constructs a supervised projection network guided by a post-projection kernel regression loss, and employs adaptive generalized kernels that can be jointly optimized with the projection. This approach enables AKRMap to efficiently generate visualizations that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yilinye/akrmap
pytorchOfficial

Videos

AKRMap: Adaptive Kernel Regression for Trustworthy Visualization of Cross-Modal Embeddings· slideslive

Taxonomy

TopicsExplainable Artificial Intelligence (XAI)

MethodsFocus · Principal Components Analysis