Loading paper
This&That: Language-Gesture Controlled Video Generation for Robot Planning | Tomesphere