Loading paper
Unsupervised active speaker detection in media content using cross-modal information | Tomesphere