Loading paper
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering | Tomesphere