Loading paper
Cross-Modal Binary Attention: An Energy-Efficient Fusion Framework for Audio-Visual Learning | Tomesphere