Loading paper
Bridging High-Quality Audio and Video via Language for Sound Effects Retrieval from Visual Queries | Tomesphere