Loading paper
VideoScaffold: Elastic-Scale Visual Hierarchies for Streaming Video Understanding in MLLMs | Tomesphere