Loading paper
Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering | Tomesphere