Loading paper
SMART: Shot-Aware Multimodal Video Moment Retrieval with Audio-Enhanced MLLM | Tomesphere