Improving Semantic Image Segmentation via Label Fusion in Semantically   Textured Meshes

Florian Fervers; Timo Breuer; Gregor Stachowiak; Sebastian Bullinger,; Christoph Bodensteiner; Michael Arens

arXiv:2111.11103·cs.CV·February 25, 2022

Improving Semantic Image Segmentation via Label Fusion in Semantically Textured Meshes

Florian Fervers, Timo Breuer, Gregor Stachowiak, Sebastian Bullinger,, Christoph Bodensteiner, Michael Arens

PDF

1 Repo

TL;DR

This paper introduces an unsupervised label fusion framework using 3D meshes and semantic textures to enhance pixel-level semantic segmentation in videos, significantly improving accuracy over existing methods.

Contribution

It presents the first publicly available label fusion framework based on semantic meshes, leveraging CUDA for uncertainty-aware fusion of multi-frame predictions.

Findings

01

Improved pixel accuracy from 52.05% to 58.25% on ScanNet dataset.

02

Utilizes entire probability distribution of classes for better segmentation.

03

Provides open-source code to facilitate future research.

Abstract

Models for semantic segmentation require a large amount of hand-labeled training data which is costly and time-consuming to produce. For this purpose, we present a label fusion framework that is capable of improving semantic pixel labels of video sequences in an unsupervised manner. We make use of a 3D mesh representation of the environment and fuse the predictions of different frames into a consistent representation using semantic mesh textures. Rendering the semantic mesh using the original intrinsic and extrinsic camera parameters yields a set of improved semantic segmentation images. Due to our optimized CUDA implementation, we are able to exploit the entire $c$ -dimensional probability distribution of annotations over $c$ classes in an uncertainty-aware manner. We evaluate our method on the Scannet dataset where we improve annotations produced by the state-of-the-art segmentation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fferflo/semantic-meshes
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.