Loading paper
LiRA: Learning Visual Speech Representations from Audio through Self-supervision | Tomesphere