BoxSplitGen: A Generative Model for 3D Part Bounding Boxes in Varying Granularity

Juil Koo; Wei-Tung Lin; Chanho Park; Chanhyeok Park; Minhyuk Sung

arXiv:2602.20666·cs.CV·February 25, 2026

BoxSplitGen: A Generative Model for 3D Part Bounding Boxes in Varying Granularity

Juil Koo, Wei-Tung Lin, Chanho Park, Chanhyeok Park, Minhyuk Sung

PDF

Open Access

TL;DR

This paper introduces BoxSplitGen, a novel generative framework for creating 3D shapes by iteratively splitting bounding boxes to refine shape details, aiding human-like 3D design processes.

Contribution

We propose a new generative model that produces 3D part bounding boxes with varying granularity and integrates with 3D diffusion models for shape generation.

Findings

01

BoxSplitGen outperforms token prediction and inpainting models.

02

The box-to-shape model achieves superior results with state-of-the-art diffusion.

03

Framework enables intuitive 3D shape refinement from coarse to fine.

Abstract

Human creativity follows a perceptual process, moving from abstract ideas to finer details during creation. While 3D generative models have advanced dramatically, models specifically designed to assist human imagination in 3D creation -- particularly for detailing abstractions from coarse to fine -- have not been explored. We propose a framework that enables intuitive and interactive 3D shape generation by iteratively splitting bounding boxes to refine the set of bounding boxes. The main technical components of our framework are two generative models: the box-splitting generative model and the box-to-shape generative model. The first model, named BoxSplitGen, generates a collection of 3D part bounding boxes with varying granularity by iteratively splitting coarse bounding boxes. It utilizes part bounding boxes created through agglomerative merging and learns the reverse of the merging…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Music Technology and Sound Studies · Generative Adversarial Networks and Image Synthesis