Loading paper
C3L: Content Correlated Vision-Language Instruction Tuning Data Generation via Contrastive Learning | Tomesphere