Predicting integers from continuous parameters
Bas Maat, Peter Bloem

TL;DR
This paper explores modeling integer-valued predictions directly with discrete distributions in neural networks, comparing various options across different tasks.
Contribution
It introduces and evaluates novel and existing discrete distributions for neural network output layers to improve integer prediction accuracy.
Findings
Bitwise distribution performs well across tasks.
Discrete Laplace distribution effectively models integer targets.
Direct discrete modeling outperforms continuous regression in some cases.
Abstract
We study the problem of predicting numeric labels that are constrained to the integers or to a subrange of the integers. For example, the number of up-votes on social media posts, or the number of bicycles available at a public rental station. While it is possible to model these as continuous values, and to apply traditional regression, this approach changes the underlying distribution on the labels from discrete to continuous. Discrete distributions have certain benefits, which leads us to the question whether such integer labels can be modeled directly by a discrete distribution, whose parameters are predicted from the features of a given instance. Moreover, we focus on the use case of output distributions of neural networks, which adds the requirement that the parameters of the distribution be continuous so that backpropagation and gradient descent may be used to learn the weights of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
