A survey of Object Classification and Detection based on 2D/3D data
Xiaoke Shen

TL;DR
This survey reviews recent advances in 2D and 3D object classification and detection, highlighting challenges, data representations, and applications in autonomous systems, with a focus on deep learning methods.
Contribution
It provides a comprehensive overview of 2D and 3D object recognition techniques, analyzing their differences, challenges, and connections, and categorizes systems by application, data representation, and task.
Findings
3D methods improve location accuracy over 2D systems.
Challenges include data complexity, resource demands, and dataset scarcity.
Integration of 2D and 3D approaches enhances object recognition capabilities.
Abstract
Recently, by using deep neural network based algorithms, object classification, detection and semantic segmentation solutions are significantly improved. However, one challenge for 2D image-based systems is that they cannot provide accurate 3D location information. This is critical for location sensitive applications such as autonomous driving and robot navigation. On the other hand, 3D methods, such as RGB-D and RGB-LiDAR based systems, can provide solutions that significantly improve the RGB only approaches. That is why this is an interesting research area for both industry and academia. Compared with 2D image-based systems, 3D-based systems are more complicated due to the following five reasons: 1) Data representation itself is more complicated. 3D images can be represented by point clouds, meshes, volumes. 2D images have pixel grid representations. 2) The computation and memory…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques
