Viterbi training in PRISM

Taisuke Sato; Keiichi Kubota

arXiv:1303.5659·cs.AI·February 19, 2020

Viterbi training in PRISM

Taisuke Sato, Keiichi Kubota

PDF

TL;DR

This paper introduces Viterbi training (VT) into PRISM, a logic-based probabilistic system, demonstrating faster convergence and improved parsing accuracy in probabilistic grammars, while simplifying implementation for users.

Contribution

The paper presents the integration of VT into PRISM, showing its advantages over EM in convergence speed, prediction performance, and ease of use for logic programmers.

Findings

01

VT in PRISM converges faster than EM.

02

VT achieves the best parsing accuracy in experiments.

03

VT does not require the PRISM exclusiveness condition.

Abstract

VT (Viterbi training), or hard EM, is an efficient way of parameter learning for probabilistic models with hidden variables. Given an observation $y$ , it searches for a state of hidden variables $x$ that maximizes $p (x, y ∣ θ)$ by coordinate ascent on parameters $θ$ and $x$ . In this paper we introduce VT to PRISM, a logic-based probabilistic modeling system for generative models. VT improves PRISM in three ways. First VT in PRISM converges faster than EM in PRISM due to the VT's termination condition. Second, parameters learned by VT often show good prediction performance compared to those learned by EM. We conducted two parsing experiments with probabilistic grammars while learning parameters by a variety of inference methods, i.e.\ VT, EM, MAP and VB. The result is that VT achieved the best parsing accuracy among them in both experiments. Also we conducted a similar…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.