Loading paper
On the Evaluation and Refinement of Vision-Language Instruction Tuning Datasets | Tomesphere