Loading paper
Fine-tuning Pre-trained Vision-Language Models in a Human-Annotation-Free Manner | Tomesphere