A New Unbiased and Efficient Class of LSH-Based Samplers and Estimators   for Partition Function Computation in Log-Linear Models

Ryan Spring; Anshumali Shrivastava

arXiv:1703.05160·stat.ML·March 16, 2017·24 cites

A New Unbiased and Efficient Class of LSH-Based Samplers and Estimators for Partition Function Computation in Log-Linear Models

Ryan Spring, Anshumali Shrivastava

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel LSH-based sampling method and unbiased estimator that significantly improves the efficiency and accuracy of partition function computation in large-scale log-linear models, enabling faster training of language models.

Contribution

The paper presents a new LSH-based sampling scheme and unbiased estimator that achieve sub-linear time partition function estimation, outperforming existing importance sampling and Gumbel-Max methods.

Findings

01

Accurate partition function estimation in sub-linear time.

02

Effective training of language models with only 1-2% of usual computations.

03

Outperforms state-of-the-art sampling techniques in speed and accuracy.

Abstract

Log-linear models are arguably the most successful class of graphical models for large-scale applications because of their simplicity and tractability. Learning and inference with these models require calculating the partition function, which is a major bottleneck and intractable for large state spaces. Importance Sampling (IS) and MCMC-based approaches are lucrative. However, the condition of having a "good" proposal distribution is often not satisfied in practice. In this paper, we add a new dimension to efficient estimation via sampling. We propose a new sampling scheme and an unbiased estimator that estimates the partition function accurately in sub-linear time. Our samples are generated in near-constant time using locality sensitive hashing (LSH), and so are correlated and unnormalized. We demonstrate the effectiveness of our proposed approach by comparing the accuracy and speed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rdspring1/lsh_deeplearning
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Algorithms and Data Compression · Domain Adaptation and Few-Shot Learning

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings