Comparison of OpenMP & OpenCL Parallel Processing Technologies
Krishnahari Thouti, S. R. Sathe

TL;DR
This paper compares OpenMP and OpenCL for parallel processing, analyzing their performance on various algorithms, and finds that OpenCL can effectively utilize multiple cores for improved computational efficiency.
Contribution
It provides a performance comparison of OpenMP and OpenCL, highlighting OpenCL's effectiveness in mapping threads across different processing cores.
Findings
OpenCL effectively maps threads on multiple cores.
Proper load balancing improves performance.
OpenCL shows competitive performance in benchmark tests.
Abstract
This paper presents a comparison of OpenMP and OpenCL based on the parallel implementation of algorithms from various fields of computer applications. The focus of our study is on the performance of benchmark comparing OpenMP and OpenCL. We observed that OpenCL programming model is a good option for mapping threads on different processing cores. Balancing all available cores and allocating sufficient amount of work among all computing units, can lead to improved performance. In our simulation, we used Fedora operating system; a system with Intel Xeon Dual core processor having thread count 24 coupled with NVIDIA Quadro FX 3800 as graphical processing unit.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems
