Hidden Markov Chains, Entropic Forward-Backward, and Part-Of-Speech Tagging
Elie Azeraf, Emmanuel Monfrini, Emmanuel Vignon, Wojciech Pieczynski

TL;DR
This paper introduces a novel Entropic Forward-Backward algorithm for Hidden Markov Chains, enabling the incorporation of arbitrary features in NLP tasks like Part-Of-Speech tagging, and demonstrates its superiority over MEMMs.
Contribution
It presents a new computation method for HMCs using Entropic Forward-Backward probabilities, allowing feature integration similar to MEMMs.
Findings
HMC with EFB outperforms MEMM in POS tagging.
The method enables arbitrary feature use in HMC models.
Potential to replace RNNs with HMCs using EFB in deep architectures.
Abstract
The ability to take into account the characteristics - also called features - of observations is essential in Natural Language Processing (NLP) problems. Hidden Markov Chain (HMC) model associated with classic Forward-Backward probabilities cannot handle arbitrary features like prefixes or suffixes of any size, except with an independence condition. For twenty years, this default has encouraged the development of other sequential models, starting with the Maximum Entropy Markov Model (MEMM), which elegantly integrates arbitrary features. More generally, it led to neglect HMC for NLP. In this paper, we show that the problem is not due to HMC itself, but to the way its restoration algorithms are computed. We present a new way of computing HMC based restorations using original Entropic Forward and Entropic Backward (EFB) probabilities. Our method allows taking into account features in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis
