IVS3D: An Open Source Framework for Intelligent Video Sampling and   Preprocessing to Facilitate 3D Reconstruction

Max Hermann; Thomas Pollok; Daniel Brommer; Dominic Zahn

arXiv:2110.11810·cs.CV·October 25, 2021

IVS3D: An Open Source Framework for Intelligent Video Sampling and Preprocessing to Facilitate 3D Reconstruction

Max Hermann, Thomas Pollok, Daniel Brommer, Dominic Zahn

PDF

Open Access 1 Repo

TL;DR

IVS3D is an open source framework that intelligently preprocesses video data by sampling, resizing, and filtering images to facilitate efficient and accurate 3D reconstruction, with modular plugins for enhanced functionality.

Contribution

The paper introduces IVS3D, a modular open source framework with novel plugins for intelligent image selection, motion blur detection, and semantic filtering to improve 3D reconstruction workflows.

Findings

01

Reduces 3D reconstruction time by excluding redundant images.

02

Improves image quality by filtering motion-blurred frames.

03

Enables customizable preprocessing through plugin architecture.

Abstract

The creation of detailed 3D models is relevant for a wide range of applications such as navigation in three-dimensional space, construction planning or disaster assessment. However, the complex processing and long execution time for detailed 3D reconstructions require the original database to be reduced in order to obtain a result in reasonable time. In this paper we therefore present our framework iVS3D for intelligent pre-processing of image sequences. Our software is able to down sample entire videos to a specific frame rate, as well as to resize and crop the individual images. Furthermore, thanks to our modular architecture, it is easy to develop and integrate plugins with additional algorithms. We provide three plugins as baseline methods that enable an intelligent selection of suitable images and can enrich them with additional information. To filter out images affected by motion…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ivs3d/ivs3d
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Advanced Image and Video Retrieval Techniques · Advanced Image Processing Techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings