Loading paper
MLP Architectures for Vision-and-Language Modeling: An Empirical Study | Tomesphere