Loading paper
TextOCVP: Object-Centric Video Prediction with Language Guidance | Tomesphere