Loading paper
Grounding Everything in Tokens for Multimodal Large Language Models | Tomesphere