Loading paper
Spatio-Temporal Crop Aggregation for Video Representation Learning | Tomesphere