Ensemble Methods for Sequence Classification with Hidden Markov Models
Maxime Kawawa-Beaudan, Srijan Sood, Soham Palande, Ganapathy Mani,, Tucker Balch, Manuela Veloso

TL;DR
This paper introduces an ensemble approach for Hidden Markov Models to improve sequence classification, especially in imbalanced and small datasets, demonstrating robustness and efficiency over neural networks.
Contribution
It proposes a novel ensemble training method for HMMs that generalizes to multi-class problems and enhances performance in imbalanced scenarios.
Findings
High average precision and AUC achieved
Outperforms neural networks in data-scarce environments
Effective in domains like finance and biology
Abstract
We present a lightweight approach to sequence classification using Ensemble Methods for Hidden Markov Models (HMMs). HMMs offer significant advantages in scenarios with imbalanced or smaller datasets due to their simplicity, interpretability, and efficiency. These models are particularly effective in domains such as finance and biology, where traditional methods struggle with high feature dimensionality and varied sequence lengths. Our ensemble-based scoring method enables the comparison of sequences of any length and improves performance on imbalanced datasets. This study focuses on the binary classification problem, particularly in scenarios with data imbalance, where the negative class is the majority (e.g., normal data) and the positive class is the minority (e.g., anomalous data), often with extreme distribution skews. We propose a novel training approach for HMM Ensembles that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
