Loading paper
Scaffolding Coordinates to Promote Vision-Language Coordination in Large Multi-Modal Models | Tomesphere