TPA-Net: Generate A Dataset for Text to Physics-based Animation

Yuxing Qiu; Feng Gao; Minchen Li; Govind Thattai; Yin Yang; Chenfanfu; Jiang

arXiv:2211.13887·cs.AI·November 28, 2022

TPA-Net: Generate A Dataset for Text to Physics-based Animation

Yuxing Qiu, Feng Gao, Minchen Li, Govind Thattai, Yin Yang, Chenfanfu, Jiang

PDF

Open Access

TL;DR

This paper introduces TPA-Net, a dataset with high-resolution 3D physical simulations and textual descriptions to improve the physical realism of text-to-video generation systems.

Contribution

It presents an autonomous data generation method and a comprehensive dataset using advanced physical simulation techniques for better physical realism in text-to-video tasks.

Findings

01

Provides high-quality multi-view videos for T2V and NeRF research.

02

Uses state-of-the-art physical simulation methods like IPC and MPM.

03

Lays groundwork for fully automated text-to-video/simulation systems.

Abstract

Recent breakthroughs in Vision-Language (V&L) joint research have achieved remarkable results in various text-driven tasks. High-quality Text-to-video (T2V), a task that has been long considered mission-impossible, was proven feasible with reasonably good results in latest works. However, the resulting videos often have undesired artifacts largely because the system is purely data-driven and agnostic to the physical laws. To tackle this issue and further push T2V towards high-level physical realism, we present an autonomous data generation technique and a dataset, which intend to narrow the gap with a large number of multi-modal, 3D Text-to-Video/Simulation (T2V/S) data. In the dataset, we provide high-resolution 3D physical simulations for both solids and fluids, along with textual descriptions of the physical phenomena. We take advantage of state-of-the-art physical simulation methods…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Processing and 3D Reconstruction · Machine Learning in Materials Science · Handwritten Text Recognition Techniques