Loading paper
Exploiting temporal information to detect conversational groups in videos and predict the next speaker | Tomesphere