Loading paper
A Whisper transformer for audio captioning trained with synthetic captions and transfer learning | Tomesphere