Loading paper
Towards Local Visual Modeling for Image Captioning | Tomesphere