Loading paper
Learning to Recognise Words using Visually Grounded Speech | Tomesphere