A Study of Performance Programming of CPU, GPU accelerated Computers and SIMD Architecture
Xinyao Yi

TL;DR
This paper reviews the current state and future trends of parallel programming techniques for CPU, GPU, and SIMD architectures, highlighting performance challenges and solutions in high-performance computing.
Contribution
It provides a comprehensive summary of parallel programming methods, performance issues, and future directions in heterogeneous and SIMD computing architectures.
Findings
Parallel computing methods include multithreading, GPU/FPGA acceleration, and SIMD architectures.
Performance bottlenecks often stem from data transfer and hardware complexity.
The paper offers insights into optimizing parallel programs and future research directions.
Abstract
Parallel computing is a standard approach to achieving high-performance computing (HPC). Three commonly used methods to implement parallel computing include: 1) applying multithreading technology on single-core or multi-core CPUs; 2) incorporating powerful parallel computing devices such as GPUs, FPGAs, and other accelerators; and 3) utilizing special parallel architectures like Single Instruction/Multiple Data (SIMD). Many researchers have made efforts using different parallel technologies, including developing applications, conducting performance analyses, identifying performance bottlenecks, and proposing feasible solutions. However, balancing and optimizing parallel programs remain challenging due to the complexity of parallel algorithms and hardware architectures. Issues such as data transfer between hosts and devices in heterogeneous systems continue to be bottlenecks that limit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInternet of Things and Social Network Interactions
