ESP-PCT: Enhanced VR Semantic Performance through Efficient Compression   of Temporal and Spatial Redundancies in Point Cloud Transformers

Luoyu Mei; Shuai Wang; Yun Cheng; Ruofeng Liu; Zhimeng Yin; Wenchao; Jiang; Shuai Wang; Wei Gong

arXiv:2409.01216·cs.CV·September 4, 2024

ESP-PCT: Enhanced VR Semantic Performance through Efficient Compression of Temporal and Spatial Redundancies in Point Cloud Transformers

Luoyu Mei, Shuai Wang, Yun Cheng, Ruofeng Liu, Zhimeng Yin, Wenchao, Jiang, Shuai Wang, Wei Gong

PDF

1 Repo

TL;DR

ESP-PCT is a novel point cloud transformer designed for VR semantic recognition that significantly improves accuracy while reducing computational and memory requirements through efficient redundancy compression.

Contribution

Introduces ESP-PCT, a two-stage semantic recognition framework that enhances VR point cloud processing by jointly training localization and focus stages end-to-end.

Findings

01

Achieves 93.2% recognition accuracy.

02

Reduces FLOPs by 76.9%.

03

Lowers memory usage by 78.2%.

Abstract

Semantic recognition is pivotal in virtual reality (VR) applications, enabling immersive and interactive experiences. A promising approach is utilizing millimeter-wave (mmWave) signals to generate point clouds. However, the high computational and memory demands of current mmWave point cloud models hinder their efficiency and reliability. To address this limitation, our paper introduces ESP-PCT, a novel Enhanced Semantic Performance Point Cloud Transformer with a two-stage semantic recognition framework tailored for VR applications. ESP-PCT takes advantage of the accuracy of sensory point cloud data and optimizes the semantic recognition process, where the localization and focus stages are trained jointly in an end-to-end manner. We evaluate ESP-PCT on various VR semantic recognition conditions, demonstrating substantial enhancements in recognition efficiency. Notably, ESP-PCT achieves a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lymei-seu/esp-pct
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAttention Is All You Need · Byte Pair Encoding · Absolute Position Encodings · Softmax · Label Smoothing · Linear Layer · Adam · Dropout · Layer Normalization · Dense Connections