Loading paper
Read, Watch, and Move: Reinforcement Learning for Temporally Grounding Natural Language Descriptions in Videos | Tomesphere