Tiling for Performance Tuning on Different Models of GPUs
Chang Xu (1), Steven R. Kirk (2), Samantha Jenkins (2) ((1), Department of Information Engineering, Zhejiang Business Technology, Institute, Ningbo, China (2) Department of Computer Science, Informatics,, University West, Trollhattan, Sweden)

TL;DR
This paper investigates how different GPU models impact the effectiveness of tiling strategies in CUDA programs, using image interpolation as a test case, highlighting the need for model-specific optimization.
Contribution
It analyzes the influence of various GPU models on tiling performance, emphasizing that optimization strategies must be tailored to specific hardware for best results.
Findings
Tiling strategies vary in effectiveness across GPU models.
Optimizations on one GPU model may not transfer well to others.
External conditions influence tiling performance significantly.
Abstract
The strategy of using CUDA-compatible GPUs as a parallel computation solution to improve the performance of programs has been more and more widely approved during the last two years since the CUDA platform was released. Its benefit extends from the graphic domain to many other computationally intensive domains. Tiling, as the most general and important technique, is widely used for optimization in CUDA programs. New models of GPUs with better compute capabilities have, however, been released, new versions of CUDA SDKs were also released. These updated compute capabilities must to be considered when optimizing using the tiling technique. In this paper, we implement image interpolation algorithms as a test case to discuss how different tiling strategies affect the program's performance. We especially focus on how the different models of GPUs affect the tiling's effectiveness by executing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
