Loading paper
Structured Two-stream Attention Network for Video Question Answering | Tomesphere