Loading paper
Leveraging Pre-trained BERT for Audio Captioning | Tomesphere