SOccDPT: Semi-Supervised 3D Semantic Occupancy from Dense Prediction   Transformers trained under memory constraints

Aditya Nalgunda Ganesh

arXiv:2311.11371·cs.CV·January 7, 2025·1 cites

SOccDPT: Semi-Supervised 3D Semantic Occupancy from Dense Prediction Transformers trained under memory constraints

Aditya Nalgunda Ganesh

PDF

Open Access

TL;DR

SOccDPT is a memory-efficient semi-supervised transformer-based model that predicts 3D semantic occupancy from monocular images, trained on unstructured datasets, and outperforms existing methods in unstructured traffic scenarios.

Contribution

We introduce SOccDPT, a novel semi-supervised dense prediction transformer model trained on unstructured datasets with patch-wise training for memory efficiency.

Findings

01

RMSE score of 9.1473 for disparity estimation

02

Semantic segmentation IoU of 46.02%

03

Operates at 69.47 Hz

Abstract

We present SOccDPT, a memory-efficient approach for 3D semantic occupancy prediction from monocular image input using dense prediction transformers. To address the limitations of existing methods trained on structured traffic datasets, we train our model on unstructured datasets including the Indian Driving Dataset and Bengaluru Driving Dataset. Our semi-supervised training pipeline allows SOccDPT to learn from datasets with limited labels by reducing the requirement for manual labelling by substituting it with pseudo-ground truth labels to produce our Bengaluru Semantic Occupancy Dataset. This broader training enhances our model's ability to handle unstructured traffic scenarios effectively. To overcome memory limitations during training, we introduce patch-wise training where we select a subset of parameters to train each epoch, reducing memory usage during auto-grad graph…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic Prediction and Management Techniques · Advanced Neural Network Applications · Infrastructure Maintenance and Monitoring