Loading paper
Streaming Audio-Visual Speech Recognition with Alignment Regularization | Tomesphere