Loading paper
Cross-Modal Bottleneck Fusion For Noise Robust Audio-Visual Speech Recognition | Tomesphere