Loading paper
Multi-modal Factorized Bilinear Pooling with Co-Attention Learning for Visual Question Answering | Tomesphere