Loading paper
Towards Good Practices for Multi-modal Fusion in Large-scale Video Classification | Tomesphere