Loading paper
R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling | Tomesphere