Loading paper
ReMoRa: Multimodal Large Language Model based on Refined Motion Representation for Long-Video Understanding | Tomesphere