Loading paper
Towards Pixel-Level VLM Perception via Simple Points Prediction | Tomesphere