Loading paper
Multi-modal Aggregation for Video Classification | Tomesphere