Efficient Computation of Expectations under Spanning Tree Distributions
Ran Zmigrod, Tim Vieira, Ryan Cotterell

TL;DR
This paper introduces a unified, efficient framework for computing expectations and gradients in spanning tree models, significantly improving runtime performance and simplifying implementation.
Contribution
The authors develop a general framework that unifies and accelerates the computation of expectations and gradients in spanning tree models, including new algorithms for previously unaddressed quantities.
Findings
Algorithms are up to 15 times faster for entropy computation.
Framework reduces runtime complexity by a factor of sentence length.
Validated through runtime experiments showing significant speedups.
Abstract
We give a general framework for inference in spanning tree models. We propose unified algorithms for the important cases of first-order expectations and second-order expectations in edge-factored, non-projective spanning-tree models. Our algorithms exploit a fundamental connection between gradients and expectations, which allows us to derive efficient algorithms. These algorithms are easy to implement with or without automatic differentiation software. We motivate the development of our framework with several \emph{cautionary tales} of previous research, which has developed numerous inefficient algorithms for computing expectations and their gradients. We demonstrate how our framework efficiently computes several quantities with known algorithms, including the expected attachment score, entropy, and generalized expectation criteria. As a bonus, we give algorithms for quantities that are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
