Loading paper
HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models | Tomesphere