Loading paper
Leveraging Vision-Language Models to Detect Attention in Educational Videos | Tomesphere