Loading paper
Improving speaker turn embedding by crossmodal transfer learning from face embedding | Tomesphere