Loading paper
Video captioning with recurrent networks based on frame- and video-level features and visual content classification | Tomesphere