Loading paper
Pre-Training Multi-Modal Dense Retrievers for Outside-Knowledge Visual Question Answering | Tomesphere