Algorithms for Acyclic Weighted Finite-State Automata with Failure Arcs

Anej Svete; Benjamin Dayan; Tim Vieira; Ryan Cotterell; Jason Eisner

arXiv:2301.06862·cs.DS·July 12, 2023

Algorithms for Acyclic Weighted Finite-State Automata with Failure Arcs

Anej Svete, Benjamin Dayan, Tim Vieira, Ryan Cotterell, Jason Eisner

PDF

1 Repo

TL;DR

This paper extends the backward algorithm for weighted finite-state automata to efficiently handle failure transitions directly, enabling more compact representations in NLP models without preprocessing overhead.

Contribution

It introduces a novel algorithm for acyclic WFSAs with failure arcs that operates efficiently without eliminating failure transitions, improving computational performance in NLP applications.

Findings

01

The extended algorithm runs in near-linear time relative to the number of transitions.

02

Efficiency is achieved when the average outgoing arcs per state are small.

03

Special cases like CRFs and ring-weighted automata further improve performance.

Abstract

Weighted finite-state automata (WSFAs) are commonly used in NLP. Failure transitions are a useful extension for compactly representing backoffs or interpolation in $n$ -gram models and CRFs, which are special cases of WFSAs. The pathsum in ordinary acyclic WFSAs is efficiently computed by the backward algorithm in time $O (∣ E ∣)$ , where $E$ is the set of transitions. However, this does not allow failure transitions, and preprocessing the WFSA to eliminate failure transitions could greatly increase $∣ E ∣$ . We extend the backward algorithm to handle failure transitions directly. Our approach is efficient when the average state has outgoing arcs for only a small fraction $s ≪ 1$ of the alphabet $Σ$ . We propose an algorithm for general acyclic WFSAs which runs in $O (∣ E ∣ + s ∣Σ∣∣ Q ∣ T_{max} lo g ∣Σ∣)$ , where $Q$ is the set of states and $T_{max}$ is the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rycolab/failure-backward
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.