A constrained recursion algorithm for batch normalization of tree-sturctured LSTM
Ruo Ando, Yoshiyasu Takefuji

TL;DR
This paper introduces a constrained recursive algorithm for batch normalization in tree-structured LSTM networks, enabling better hyperparameter control and improved training efficiency.
Contribution
It proposes a novel recursion method with constraints for traversing and normalizing tree-structured LSTM, facilitating hyperparameter tuning.
Findings
Effective hyperparameter tuning for batch normalization in tree-LSTM.
Improved training efficiency and reduced validation loss.
Controlled traversal enabling optimized model performance.
Abstract
Tree-structured LSTM is promising way to consider long-distance interaction over hierarchies. However, there have been few research efforts on the hyperparameter tuning of the construction and traversal of tree-structured LSTM. To name a few, hyperparamters such as the interval of state initialization, the number of batches for normalization have been left unexplored specifically in applying batch normalization for reducing training cost and parallelization. In this paper, we propose a novel recursive algorithm for traversing batch normalized tree-structured LSTM. In proposal method, we impose the constraint on the recursion algorithm for the depth-first search of binary tree representation of LSTM for which batch normalization is applied. With our constrained recursion, we can control the hyperparameter in the traversal of several tree-structured LSTMs which is generated in the process…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech Recognition and Synthesis · Topic Modeling
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Batch Normalization
