Loading paper
VTQA: Visual Text Question Answering via Entity Alignment and Cross-Media Reasoning | Tomesphere