Loading paper
Low-Rank HOCA: Efficient High-Order Cross-Modal Attention for Video Captioning | Tomesphere