Exploring the Versal AI Engine for 3D Gaussian Splatting
Kotaro Shimamura, Ayumi Ohno, Shinya Takamaeda-Yamazaki

TL;DR
This paper evaluates the AMD Versal AI Engine's performance on 3D Gaussian splatting workloads, demonstrating a 226-fold throughput improvement through a novel algorithm that exploits the architecture's spatial and SIMD capabilities.
Contribution
It provides the first comprehensive performance analysis of the Versal AI Engine on 3D Gaussian splatting and introduces a dedicated algorithm to maximize hardware utilization.
Findings
226-fold throughput increase with the proposed method
Efficient processing of matrix multiplications and color computations
Effective pipelined task assignment exploiting spatial parallelism
Abstract
Dataflow-oriented spatial architectures are the emerging paradigm for higher computation performance and efficiency. AMD Versal AI Engine is a commercial spatial architecture consisting of tiles of VLIW processors supporting SIMD operations arranged in a two-dimensional mesh. The architecture requires the explicit design of task assignments and dataflow configurations for each tile to maximize performance, demanding advanced techniques and meticulous design. However, a few works revealed the performance characteristics of the Versal AI Engine through practical workloads. In this work, we provide the comprehensive performance evaluation of the Versal AI Engine using Gaussian feature computation in 3D Gaussian splatting as a practical workload, and we then propose a novel dedicated algorithm to fully exploit the hardware architecture. The computations of 3D Gaussian splatting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndustrial Vision Systems and Defect Detection · Image Processing and 3D Reconstruction
