Loading paper
DiG: Differential Grounding for Enhancing Fine-Grained Perception in Multimodal Large Language Model | Tomesphere