Loading paper
Text-Only Training for Image Captioning using Noise-Injected CLIP | Tomesphere