Loading paper
FlipVQA: Scaling Multi-modal Instruction Tuning via Textbook-to-Knowledge Synthesis | Tomesphere