Loading paper
Consensus-based Sequence Training for Video Captioning | Tomesphere