Loading paper
Bridging the Modality Gap: Softly Discretizing Audio Representation for LLM-based Automatic Speech Recognition | Tomesphere