Loading paper
VFaith: Do Large Multimodal Models Really Reason on Seen Images Rather than Previous Memories? | Tomesphere