Loading paper
token2vec: A Joint Self-Supervised Pre-training Framework Using Unpaired Speech and Text | Tomesphere