Loading paper
Training Audio Captioning Models without Audio | Tomesphere