ePose: Let's Make EfficientPose More Generally Applicable

Austin Lally; Robert Bain; Mazen Alotaibi

arXiv:2111.15114·cs.CV·December 1, 2021

ePose: Let's Make EfficientPose More Generally Applicable

Austin Lally, Robert Bain, Mazen Alotaibi

PDF

Open Access 1 Repo

TL;DR

This paper introduces ePose, an enhanced version of EfficientPose that can infer object sizes and simplifies data collection and loss calculations, aiming for broader applicability in 3D object detection.

Contribution

ePose extends EfficientPose by enabling size inference and streamlining data and loss processes for improved 3D detection.

Findings

01

Evaluated on Linemod and Occlusion 1-class datasets

02

Demonstrated improved efficiency and applicability

03

Discussed potential use with NuScenes and KITTI datasets

Abstract

EfficientPose is an impressive 3D object detection model. It has been demonstrated to be quick, scalable, and accurate, especially when considering that it uses only RGB inputs. In this paper we try to improve on EfficientPose by giving it the ability to infer an object's size, and by simplifying both the data collection and loss calculations. We evaluated ePose using the Linemod dataset and a new subset of it called "Occlusion 1-class". We also outline our current progress and thoughts about using ePose with the NuScenes and the 2017 KITTI 3D Object Detection datasets. The source code is available at https://github.com/tbd-clip/EfficientPose.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tbd-clip/efficientpose
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Human Pose and Action Recognition · Robotics and Sensor-Based Localization