Loading paper
Teaching Vision-Language Models to Ask: Resolving Ambiguity in Visual Questions | Tomesphere