Loading paper
Gaussian Kernelized Self-Attention for Long Sequence Data and Its Application to CTC-based Speech Recognition | Tomesphere