Loading paper
Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models | Tomesphere