Loading paper
Multi-Source Transformer Architectures for Audiovisual Scene Classification | Tomesphere