Loading paper
Top-Down Compression: Revisit Efficient Vision Token Projection for Visual Instruction Tuning | Tomesphere