Loading paper
SFA: Scan, Focus, and Amplify toward Guidance-aware Answering for Video TextVQA | Tomesphere