Accelerating Compact Fractals with Tensor Core GPUs
Felipe A. Quezada, Crist\'obal A. Navarro

TL;DR
This paper introduces a GPU-based method using tensor cores to efficiently process compact representations of fractals, significantly speeding up computations and reducing memory usage for large-scale fractal problems.
Contribution
The work develops a novel tensor core GPU thread mapping technique for compact fractal representations, enabling faster and more memory-efficient fractal computations.
Findings
Up to 11x speedup on A100 GPU
234x reduction in memory usage
Applicable to NBB class fractals and extendable to 3D
Abstract
This work presents a GPU thread mapping approach that allows doing fast parallel stencil-like computations on discrete fractals using their compact representation. The intuition behind is to employ two GPU tensor-core accelerated thread maps, and , which act as threadspace-to-dataspace and dataspace-to-threadspace functions, respectively. By combining these maps, threads can access compact space and interact with their neighbors. The cost of the maps is time, with being the side of a embedding for the fractal in its expanded form. The technique works on any fractal that belongs to the Non-overlapping-Bounding-Boxes (NBB) class of discrete fractals, and can be extended to three dimensions as well. Results using an A100 GPU on the Sierpinski Triangle as a case study show up to of speedup and a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCellular Automata and Applications · Algorithms and Data Compression · Advanced Data Storage Technologies
