A linear memory algorithm for Baum-Welch training
Istvan Miklos, Irmtraud M. Meyer

TL;DR
This paper introduces a linear memory algorithm for Baum-Welch training of hidden Markov models, significantly reducing memory usage while maintaining computational efficiency, especially for large-scale biological data.
Contribution
The authors present a novel linear space algorithm for Baum-Welch training that reduces memory requirements to be independent of sequence length, outperforming previous checkpointing methods.
Findings
Requires O(M) memory, independent of sequence length.
Reduces memory from O(log(L) M) to O(M) for large sequences.
Faster and more memory-efficient for gene prediction HMMs.
Abstract
Background: Baum-Welch training is an expectation-maximisation algorithm for training the emission and transition probabilities of hidden Markov models in a fully automated way. Methods and results: We introduce a linear space algorithm for Baum-Welch training. For a hidden Markov model with M states, T free transition and E free emission parameters, and an input sequence of length L, our new algorithm requires O(M) memory and O(L M T_max (T + E)) time for one Baum-Welch iteration, where T_max is the maximum number of states that any state is connected to. The most memory efficient algorithm until now was the checkpointing algorithm with O(log(L) M) memory and O(log(L) L M T_max) time requirement. Our novel algorithm thus renders the memory requirement completely independent of the length of the training sequences. More generally, for an n-hidden Markov model and n input sequences of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Evolutionary Algorithms and Applications · Metaheuristic Optimization Algorithms Research
