Zero-Shot Learning via Latent Space Encoding
Yunlong Yu, Zhong Ji, Jichang Guo, and Zhongfei (Mark) Zhang

TL;DR
This paper introduces Latent Space Encoding (LSE), a novel approach for Zero-Shot Learning that models semantic relations across modalities via a learned latent space, improving transferability and extensibility.
Contribution
LSE is a new encoder-decoder framework that implicitly learns a shared latent space for multiple modalities without requiring explicit projection functions.
Findings
Outperforms existing methods on four benchmark datasets.
Effective in traditional, generalized ZSL, and zero-shot retrieval tasks.
Easily extendable to more modalities.
Abstract
Zero-Shot Learning (ZSL) is typically achieved by resorting to a class semantic embedding space to transfer the knowledge from the seen classes to unseen ones. Capturing the common semantic characteristics between the visual modality and the class semantic modality (e.g., attributes or word vector) is a key to the success of ZSL. In this paper, we propose a novel encoder-decoder approach, namely Latent Space Encoding (LSE), to connect the semantic relations of different modalities. Instead of requiring a projection function to transfer information across different modalities like most previous work, LSE per- forms the interactions of different modalities via a feature aware latent space, which is learned in an implicit way. Specifically, different modalities are modeled separately but optimized jointly. For each modality, an encoder-decoder framework is performed to learn a feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Cancer-related molecular mechanisms research
