Loading paper
Attention-Based Keyword Localisation in Speech using Visual Grounding | Tomesphere