A Study of Performance Programming of CPU, GPU accelerated Computers and   SIMD Architecture

Xinyao Yi

arXiv:2409.10661·cs.DC·September 18, 2024·2 cites

A Study of Performance Programming of CPU, GPU accelerated Computers and SIMD Architecture

Xinyao Yi

PDF

Open Access

TL;DR

This paper reviews the current state and future trends of parallel programming techniques for CPU, GPU, and SIMD architectures, highlighting performance challenges and solutions in high-performance computing.

Contribution

It provides a comprehensive summary of parallel programming methods, performance issues, and future directions in heterogeneous and SIMD computing architectures.

Findings

01

Parallel computing methods include multithreading, GPU/FPGA acceleration, and SIMD architectures.

02

Performance bottlenecks often stem from data transfer and hardware complexity.

03

The paper offers insights into optimizing parallel programs and future research directions.

Abstract

Parallel computing is a standard approach to achieving high-performance computing (HPC). Three commonly used methods to implement parallel computing include: 1) applying multithreading technology on single-core or multi-core CPUs; 2) incorporating powerful parallel computing devices such as GPUs, FPGAs, and other accelerators; and 3) utilizing special parallel architectures like Single Instruction/Multiple Data (SIMD). Many researchers have made efforts using different parallel technologies, including developing applications, conducting performance analyses, identifying performance bottlenecks, and proposing feasible solutions. However, balancing and optimizing parallel programs remain challenging due to the complexity of parallel algorithms and hardware architectures. Issues such as data transfer between hosts and devices in heterogeneous systems continue to be bottlenecks that limit…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInternet of Things and Social Network Interactions