Estimating Object Physical Properties from RGB-D Vision and Depth Robot Sensors Using Deep Learning

Ricardo Cardoso; Plinio Moreno

arXiv:2507.05029·cs.CV·December 15, 2025

Estimating Object Physical Properties from RGB-D Vision and Depth Robot Sensors Using Deep Learning

Ricardo Cardoso, Plinio Moreno

PDF

Open Access 3 Repos

TL;DR

This paper introduces a deep learning method that combines RGB and depth data from sensors to accurately estimate object mass, using synthetic datasets for training and outperforming existing benchmarks.

Contribution

It presents a novel approach integrating RGB-D data and synthetic dataset generation for improved object mass estimation in robotics.

Findings

01

Significantly outperforms existing benchmarks in mass estimation accuracy.

02

Uses synthetic RGB-D data to train models effectively.

03

Combines point-cloud and RGB data for enhanced estimation.

Abstract

Inertial mass plays a crucial role in robotic applications such as object grasping, manipulation, and simulation, providing a strong prior for planning and control. Accurately estimating an object's mass before interaction can significantly enhance the performance of various robotic tasks. However, mass estimation using only vision sensors is a relatively underexplored area. This paper proposes a novel approach combining sparse point-cloud data from depth images with RGB images to estimate the mass of objects. We evaluate a range of point-cloud processing architectures, alongside RGB-only methods. To overcome the limited availability of training data, we create a synthetic dataset using ShapeNetSem 3D models, simulating RGBD images via a Kinect camera. This synthetic data is used to train an image generation model for estimating dense depth maps, which we then use to augment an existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIndustrial Vision Systems and Defect Detection