# X-Section: Cross-Section Prediction for Enhanced RGBD Fusion

**Authors:** Andrea Nicastro, Ronald Clark, Stefan Leutenegger

arXiv: 1903.00987 · 2019-08-13

## TL;DR

X-Section introduces a deep learning-based RGB-D reconstruction method that predicts object thicknesses to improve 3D scene completion and integration into volumetric fusion, aiding robotics and AR/VR applications.

## Contribution

It presents a novel approach that predicts object thicknesses for enhanced 3D reconstruction, extending KinectFusion with deep learning for better scene understanding.

## Key findings

- Accurately predicts object thicknesses in indoor scenes
- Reconstructs complex scenes with multiple objects
- Improves 3D scene completion accuracy

## Abstract

Detailed 3D reconstruction is an important challenge with application to robotics, augmented and virtual reality, which has seen impressive progress throughout the past years. Advancements were driven by the availability of depth cameras (RGB-D), as well as increased compute power, e.g.\ in the form of GPUs -- but also thanks to inclusion of machine learning in the process. Here, we propose X-Section, an RGB-D 3D reconstruction approach that leverages deep learning to make object-level predictions about thicknesses that can be readily integrated into a volumetric multi-view fusion process, where we propose an extension to the popular KinectFusion approach. In essence, our method allows to complete shape in general indoor scenes behind what is sensed by the RGB-D camera, which may be crucial e.g.\ for robotic manipulation tasks or efficient scene exploration. Predicting object thicknesses rather than volumes allows us to work with comparably high spatial resolution without exploding memory and training data requirements on the employed Convolutional Neural Networks. In a series of qualitative and quantitative evaluations, we demonstrate how we accurately predict object thickness and reconstruct general 3D scenes containing multiple objects.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.00987/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/1903.00987/full.md

## References

46 references — full list in the complete paper: https://tomesphere.com/paper/1903.00987/full.md

---
Source: https://tomesphere.com/paper/1903.00987