# Binary Stochastic Representations for Large Multi-class Classification

**Authors:** Thomas Gerald, Aur\'elia L\'eon, Nicolas Baskiotis, Ludovic Denoyer

arXiv: 1906.09838 · 2019-06-25

## TL;DR

This paper introduces Deep Stochastic Neural Codes (DSNC), an end-to-end model for large multi-class classification that learns to assign binary codes to categories and map inputs to codes, maintaining sublinear inference complexity without prior tuning.

## Contribution

The proposed DSNC model jointly learns binary code assignment and input-to-code mapping in an end-to-end manner, eliminating the need for heuristic code-category association.

## Key findings

- Effective on multiple datasets compared to baselines.
- Maintains sublinear inference complexity.
- No need for pre-defined code-category mappings.

## Abstract

Classification with a large number of classes is a key problem in machine learning and corresponds to many real-world applications like tagging of images or textual documents in social networks. If one-vs-all methods usually reach top performance in this context, these approaches suffer from a high inference complexity, linear w.r.t the number of categories. Different models based on the notion of binary codes have been proposed to overcome this limitation, achieving in a sublinear inference complexity. But they a priori need to decide which binary code to associate to which category before learning using more or less complex heuristics. We propose a new end-to-end model which aims at simultaneously learning to associate binary codes with categories, but also learning to map inputs to binary codes. This approach called Deep Stochastic Neural Codes (DSNC) keeps the sublinear inference complexity but do not need any a priori tuning. Experimental results on different datasets show the effectiveness of the approach w.r.t baseline methods.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.09838/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/1906.09838/full.md

## References

22 references — full list in the complete paper: https://tomesphere.com/paper/1906.09838/full.md

---
Source: https://tomesphere.com/paper/1906.09838