Loading paper
Probing Visual-Audio Representation for Video Highlight Detection via Hard-Pairs Guided Contrastive Learning | Tomesphere