# Zero-shot image classification based on class representation learning and attribute embedding learning

**Authors:** Huabo Shen, Xiaodong Sun, Youmin Hu, Changgeng Li, Qinmiao Zhu, Qin Li, Haofeng Zhang, Haofeng Zhang, Haofeng Zhang

PMC · DOI: 10.1371/journal.pone.0332797 · PLOS One · 2025-11-17

## TL;DR

This paper introduces CRAE, a new method for zero-shot image classification that improves accuracy by combining class and attribute learning.

## Contribution

CRAE introduces adaptive softmax and contrastive learning to enhance attribute and class representations in zero-shot learning.

## Key findings

- CRAE outperforms existing methods on benchmark datasets like CUB, SUN, and AWA2.
- Attribute-level contrastive learning with hard sample selection improves feature distinctiveness.
- Class-level contrastive learning enhances separation between different class features.

## Abstract

Zero-shot learning (ZSL) aims to classify unseen classes by leveraging semantic information from seen classes, addressing the challenge of limited labeled data. In recent years, ZSL methods have focused on extracting attribute-level features from images and aligning them with semantic features within an embedding space. However, existing approaches often fail to account for significant visual variations within the same attribute, leading to noisy attribute-level features that degrade classification performance.To tackle these challenges, we propose a novel zero-shot image classification method named CRAE (Class Representation and Attribute Embedding), which combines class representation learning and attribute embedding learning to enhance classification robustness and accuracy. Specifically, we design an adaptive softmax activation function to normalize attribute feature maps, effectively reducing noise and improving the discriminability of attribute-level features. Additionally, we introduce attribute-level contrastive learning with hard sample selection to optimize the attribute embedding space, reinforcing the distinctiveness of attribute representations. To further increase classification accuracy, we incorporate class-level contrastive learning to enhance the separation between features of different classes. We evaluate the effectiveness of our approach on three widely used benchmark datasets (CUB, SUN, and AWA2), and the experimental results demonstrate that CRAE significantly outperforms existing state-of-the-art methods, proving its superior capability in zero-shot image classification.

## Full-text entities

- **Diseases:** CUB (MESH:D001715), GZSL (MESH:D007859), HS (MESH:C567159), CRAE (MESH:D020969)
- **Chemicals:** CUB (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12622808/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12622808/full.md

## References

18 references — full list in the complete paper: https://tomesphere.com/paper/PMC12622808/full.md

---
Source: https://tomesphere.com/paper/PMC12622808