Loading paper
Multimodal Language Models for Domain-Specific Procedural Video Summarization | Tomesphere