Loading paper
LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs | Tomesphere