Loading paper
Text-to-Audio Grounding Based Novel Metric for Evaluating Audio Caption Similarity | Tomesphere