
TL;DR
This paper introduces Viterbi training (VT) into PRISM, a logic-based probabilistic system, demonstrating faster convergence and improved parsing accuracy in probabilistic grammars, while simplifying implementation for users.
Contribution
The paper presents the integration of VT into PRISM, showing its advantages over EM in convergence speed, prediction performance, and ease of use for logic programmers.
Findings
VT in PRISM converges faster than EM.
VT achieves the best parsing accuracy in experiments.
VT does not require the PRISM exclusiveness condition.
Abstract
VT (Viterbi training), or hard EM, is an efficient way of parameter learning for probabilistic models with hidden variables. Given an observation , it searches for a state of hidden variables that maximizes by coordinate ascent on parameters and . In this paper we introduce VT to PRISM, a logic-based probabilistic modeling system for generative models. VT improves PRISM in three ways. First VT in PRISM converges faster than EM in PRISM due to the VT's termination condition. Second, parameters learned by VT often show good prediction performance compared to those learned by EM. We conducted two parsing experiments with probabilistic grammars while learning parameters by a variety of inference methods, i.e.\ VT, EM, MAP and VB. The result is that VT achieved the best parsing accuracy among them in both experiments. Also we conducted a similar…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
