Loading paper
Multi-attention Networks for Temporal Localization of Video-level Labels | Tomesphere