Loading paper
Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment | Tomesphere