Loading paper
Discrete Cross-Modal Alignment Enables Zero-Shot Speech Translation | Tomesphere