Loading paper
ChatRex: Taming Multimodal LLM for Joint Perception and Understanding | Tomesphere