Loading paper
Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models | Tomesphere