Fine-Grained Pillar Feature Encoding Via Spatio-Temporal Virtual Grid   for 3D Object Detection

Konyul Park; Yecheol Kim; Junho Koh; Byungwoo Park; Jun Won Choi

arXiv:2403.06433·cs.CV·March 12, 2024·1 cites

Fine-Grained Pillar Feature Encoding Via Spatio-Temporal Virtual Grid for 3D Object Detection

Konyul Park, Yecheol Kim, Junho Koh, Byungwoo Park, Jun Won Choi

PDF

Open Access 1 Repo

TL;DR

This paper introduces FG-PFE, a novel pillar encoding method using spatio-temporal virtual grids to better capture LiDAR point distributions, significantly improving 3D object detection accuracy with minimal computational cost.

Contribution

The paper proposes a new fine-grained pillar feature encoding architecture utilizing spatio-temporal grids, enhancing point distribution capture within pillars for improved detection performance.

Findings

01

FG-PFE outperforms baseline models on nuScenes dataset.

02

Significant accuracy improvements with minimal additional computation.

03

Effective encoding of point distributions across vertical, temporal, and horizontal dimensions.

Abstract

Developing high-performance, real-time architectures for LiDAR-based 3D object detectors is essential for the successful commercialization of autonomous vehicles. Pillar-based methods stand out as a practical choice for onboard deployment due to their computational efficiency. However, despite their efficiency, these methods can sometimes underperform compared to alternative point encoding techniques such as Voxel-encoding or PointNet++. We argue that current pillar-based methods have not sufficiently captured the fine-grained distributions of LiDAR points within each pillar structure. Consequently, there exists considerable room for improvement in pillar feature encoding. In this paper, we introduce a novel pillar encoding architecture referred to as Fine-Grained Pillar Feature Encoding (FG-PFE). FG-PFE utilizes Spatio-Temporal Virtual (STV) grids to capture the distribution of point…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

konyul/FG-PFE
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Neural Network Applications · Industrial Vision Systems and Defect Detection