Loading paper
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer | Tomesphere