Loading paper
X-AVDT: Audio-Visual Cross-Attention for Robust Deepfake Detection | Tomesphere