Inference by Minimizing Size, Divergence, or their Sum
Sebastian Riedel, David A. Smith, Andrew McCallum

TL;DR
This paper introduces methods to speed up marginal inference by selectively ignoring less impactful factors, achieving significant computational savings with minimal accuracy loss across various models.
Contribution
It proposes three novel schemes for factor selection based on divergence and size constraints, with an efficient approximation of KL divergence for faster inference.
Findings
Up to 11x faster marginal inference compared to loopy BP
Graph sizes reduced by up to 98% with comparable accuracy
Weighted sum minimization is faster than other objectives
Abstract
We speed up marginal inference by ignoring factors that do not significantly contribute to overall accuracy. In order to pick a suitable subset of factors to ignore, we propose three schemes: minimizing the number of model factors under a bound on the KL divergence between pruned and full models; minimizing the KL divergence under a bound on factor count; and minimizing the weighted sum of KL divergence and factor count. All three problems are solved using an approximation of the KL divergence than can be calculated in terms of marginals computed on a simple seed graph. Applied to synthetic image denoising and to three different types of NLP parsing models, this technique performs marginal inference up to 11 times faster than loopy BP, with graph sizes reduced up to 98%-at comparable error in marginals and parsing accuracy. We also show that minimizing the weighted sum of divergence and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Domain Adaptation and Few-Shot Learning
