LiveScene: Language Embedding Interactive Radiance Fields for Physical   Scene Rendering and Control

Delin Qu; Qizhi Chen; Pingrui Zhang; Xianqiang Gao; Junzhe Li; Bin; Zhao; Dong Wang; Xuelong Li

arXiv:2406.16038·cs.CV·March 13, 2025

LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control

Delin Qu, Qizhi Chen, Pingrui Zhang, Xianqiang Gao, Junzhe Li, Bin, Zhao, Dong Wang, Xuelong Li

PDF

Open Access 1 Datasets

TL;DR

LiveScene introduces a novel scene-level radiance field model that uses language embeddings for interactive scene reconstruction and control, enabling efficient, accurate, and natural language-guided manipulation of complex scenes.

Contribution

The paper presents LiveScene, a new method that decomposes scenes into local deformable fields and uses language embeddings for interactive control, advancing scene reconstruction and manipulation.

Findings

01

Outperforms existing methods in novel view synthesis

02

Enables natural language-based scene control

03

Reduces memory consumption in scene reconstruction

Abstract

This paper scales object-level reconstruction to complex scenes, advancing interactive scene reconstruction. We introduce two datasets, OmniSim and InterReal, featuring 28 scenes with multiple interactive objects. To tackle the challenge of inaccurate interactive motion recovery in complex scenes, we propose LiveScene, a scene-level language-embedded interactive radiance field that efficiently reconstructs and controls multiple objects. By decomposing the interactive scene into local deformable fields, LiveScene enables separate reconstruction of individual object motions, reducing memory consumption. Additionally, our interaction-aware language embedding localizes individual interactive objects, allowing for arbitrary control using natural language. Our approach demonstrates significant superiority in novel view synthesis, interactive scene control, and language grounding performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

IPEC-COMMUNITY/LiveScene
dataset· 203 dl
203 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputer Graphics and Visualization Techniques · Advanced Vision and Imaging · Human Motion and Animation