Loading paper
Modality Attention for End-to-End Audio-visual Speech Recognition | Tomesphere