Selected Operations, Algorithms, and Applications of n-Tape Weighted Finite-State Machines
Andr\'e Kempe

TL;DR
This paper explores the operations, algorithms, and diverse applications of n-tape weighted finite-state machines (n-WFSMs), highlighting their enhanced capabilities over classical models in linguistic and computational tasks.
Contribution
It introduces the theoretical foundations of n-WFSMs, discusses restricted algorithms for their operations, and demonstrates their practical advantages in complex language processing applications.
Findings
n-WFSMs can model complex linguistic relations beyond classical transducers.
Restricted algorithms enable certain operations despite theoretical limitations.
Applications include morphological analysis, lexicon alignment, and acronym extraction.
Abstract
A weighted finite-state machine with n tapes (n-WFSM) defines a rational relation on n strings. It is a generalization of weighted acceptors (one tape) and transducers (two tapes). After recalling some basic definitions about n-ary weighted rational relations and n-WFSMs, we summarize some central operations on these relations and machines, such as join and auto-intersection. Unfortunately, due to Post's Correspondence Problem, a fully general join or auto-intersection algorithm cannot exist. We recall a restricted algorithm for a class of n-WFSMs. Through a series of practical applications, we finally investigate the augmented descriptive power of n-WFSMs and their join, compared to classical transducers and their composition. Some applications are not feasible with the latter. The series includes: the morphological analysis of Semitic languages, the preservation of intermediate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · semigroups and automata theory · Algorithms and Data Compression
