Loading paper
LRS-VoxMM: A benchmark for in-the-wild audio-visual speech recognition | Tomesphere