Open-Vocabulary High-Resolution 3D (OVHR3D) Data Segmentation and   Annotation Framework

Jiuyi Xu; Meida Chen; Andrew Feng; Zifan Yu; Yangming Shi

arXiv:2412.06268·cs.CV·December 19, 2024

Open-Vocabulary High-Resolution 3D (OVHR3D) Data Segmentation and Annotation Framework

Jiuyi Xu, Meida Chen, Andrew Feng, Zifan Yu, Yangming Shi

PDF

Open Access

TL;DR

This paper introduces OVHR3D, a framework that combines advanced models and visualization tools to improve 3D data segmentation and annotation efficiency for military simulation environments.

Contribution

It presents a novel integrated framework utilizing Grounding DINO, Segment Anything Model, and enhanced 2D rendering for efficient 3D data annotation.

Findings

01

Enhanced annotation efficiency demonstrated

02

Framework effectively integrates multiple models

03

User-friendly interface facilitates annotation process

Abstract

In the domain of the U.S. Army modeling and simulation, the availability of high quality annotated 3D data is pivotal to creating virtual environments for training and simulations. Traditional methodologies for 3D semantic and instance segmentation, such as KpConv, RandLA, Mask3D, etc., are designed to train on extensive labeled datasets to obtain satisfactory performance in practical tasks. This requirement presents a significant challenge, given the inherent scarcity of manually annotated 3D datasets, particularly for the military use cases. Recognizing this gap, our previous research leverages the One World Terrain data repository manually annotated databases, as showcased at IITSEC 2019 and 2021, to enrich the training dataset for deep learning models. However, collecting and annotating large scale 3D data for specific tasks remains costly and inefficient. To this end, the objective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques

MethodsAttention Is All You Need · Linear Layer · Softmax · Dense Connections · Residual Connection · Multi-Head Attention · Layer Normalization · Vision Transformer · self-DIstillation with NO labels