Loading paper
Improving Joint Speech-Text Representations Without Alignment | Tomesphere