Loading paper
Cascade-Free Mandarin Visual Speech Recognition via Semantic-Guided Cross-Representation Alignment | Tomesphere