Loading paper
SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information | Tomesphere