Loading paper
Efficient GPT Model Pre-training using Tensor Train Matrix Representation | Tomesphere