Loading paper
SGCap: Decoding Semantic Group for Zero-shot Video Captioning | Tomesphere