Beyond One-hot Encoding: lower dimensional target embedding
Pau Rodr\'iguez, Miguel A. Bautista, Jordi Gonz\`alez, Sergio Escalera

TL;DR
This paper introduces a low-dimensional target embedding method for CNN training that leverages label relationships, significantly speeding up convergence and maintaining high accuracy compared to traditional one-hot encoding.
Contribution
It proposes using random projections and a normalized eigenrepresentation to embed labels into a low-dimensional space, enhancing training efficiency and accuracy.
Findings
Drastically improved convergence speed on multiple datasets.
Achieved competitive accuracy with low-dimensional embeddings.
Validated the effectiveness of label manifold embeddings in CNN training.
Abstract
Target encoding plays a central role when learning Convolutional Neural Networks. In this realm, One-hot encoding is the most prevalent strategy due to its simplicity. However, this so widespread encoding schema assumes a flat label space, thus ignoring rich relationships existing among labels that can be exploited during training. In large-scale datasets, data does not span the full label space, but instead lies in a low-dimensional output manifold. Following this observation, we embed the targets into a low-dimensional space, drastically improving convergence speed while preserving accuracy. Our contribution is two fold: (i) We show that random projections of the label space are a valid tool to find such lower dimensional embeddings, boosting dramatically convergence rates at zero computational cost; and (ii) we propose a normalized eigenrepresentation of the class manifold that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
