Loading paper
Grid-LOGAT: Grid Based Local and Global Area Transcription for Video Question Answering | Tomesphere