TL;DR
This paper extends the backward algorithm for weighted finite-state automata to efficiently handle failure transitions directly, enabling more compact representations in NLP models without preprocessing overhead.
Contribution
It introduces a novel algorithm for acyclic WFSAs with failure arcs that operates efficiently without eliminating failure transitions, improving computational performance in NLP applications.
Findings
The extended algorithm runs in near-linear time relative to the number of transitions.
Efficiency is achieved when the average outgoing arcs per state are small.
Special cases like CRFs and ring-weighted automata further improve performance.
Abstract
Weighted finite-state automata (WSFAs) are commonly used in NLP. Failure transitions are a useful extension for compactly representing backoffs or interpolation in -gram models and CRFs, which are special cases of WFSAs. The pathsum in ordinary acyclic WFSAs is efficiently computed by the backward algorithm in time , where is the set of transitions. However, this does not allow failure transitions, and preprocessing the WFSA to eliminate failure transitions could greatly increase . We extend the backward algorithm to handle failure transitions directly. Our approach is efficient when the average state has outgoing arcs for only a small fraction of the alphabet . We propose an algorithm for general acyclic WFSAs which runs in , where is the set of states and is the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
