Multiway Multislice PHATE: Visualizing Hidden Dynamics of RNNs through Training
Jiancheng Xie, Lou C. Kohler Voinov, Noga Mudrik, Gal Mishne, Adam Charles

TL;DR
This paper introduces MM-PHATE, a visualization method that reveals how RNN internal representations evolve during training across multiple dimensions, aiding understanding and development of better models.
Contribution
The paper presents MM-PHATE, a novel graph-based embedding technique for visualizing RNN hidden states across time, training epochs, and units, capturing dynamic changes during learning.
Findings
MM-PHATE preserves community structure in hidden states.
It reveals training-phase changes in representation geometry.
Aligns with probes and mutual information metrics.
Abstract
Recurrent neural networks (RNNs) are a widely used tool for sequential data analysis; however, they are still often seen as black boxes. Visualizing the internal dynamics of RNNs is a critical step toward understanding their functional principles and developing better architectures and optimization strategies. Prior studies typically emphasize network representations only after training, overlooking how those representations evolve during learning. Here, we present Multiway Multislice PHATE (MM-PHATE), a graph-based embedding method for visualizing the evolution of RNN hidden states across the multiple dimensions spanned by RNNs: time, training epoch, and units. Across controlled synthetic benchmarks and real RNN applications, MM-PHATE preserves hidden-representation community structure among units and reveals training-phase changes in representation geometry. In controlled synthetic…
Peer Reviews
Decision·Submitted to ICLR 2025
## Strength 1. The visualization results look cool. 2. The motivation is clear and the understanding of RNN is meaningful. 3. The analyses are comprehensive.
## Weakness 1. The codes and datasets are missing, limiting the reproducibility. 2. Recommend generalizing the proposed method to more recent sequential models like transformers. 3. The statistical information of the datasets are missing.
- *Interesting Problem.* Understanding the learning dynamics of RNN representations is an interesting problem. It will enable additional insights into improving their architectures. Though transformers are certainly more popular these days, I think that RNNs are still useful in some areas, so this is a useful task. - *Some Nice Insights.* The author's demonstration of the relationship between the entropy of the representation and loss/over-fitting and related experiments are interesting observa
- *Novelty.* This paper builds off of PHATE and MMPhate to introduce a visualization model specifically for RNNs, which accounts for the sequential nature of the data. This involves some changes to the tensor and kernel to include additional edges amongst the sequence steps, in addition to the epoch steps. I do not see this as significant novelty. - *Weak baselines/Limited Datasets.* The authors limit their analysis to one tasks (classification), on two datasets. The visualizations compared a
- This paper is clearly written, well organized. - Experiments can effectively reflect the intended objectives of the model.
- The technical contribution is incremental. The authors merely add the time step dimension to the M-PHATE and do not make any additional adaptation for RNN. - MM-PHATE is closely related to M-PHATE, but the experiment lacks analysis of M-PHATE.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Physics and Python Applications · Biomedical and Engineering Education
