Performance Evaluation of CPU-GPU communication Depending on the Characteristic of Co-Located Workloads
Journal Title: International Journal on Computer Science and Engineering - Year 2013, Vol 5, Issue 5
Abstract
Todays, there are many studies in complicated computation and big data processing by using the high performance computability of GPU. Tesla K20X recently announced by NVIDIA provides 3.95 TFLOPS in precision floating point performance [1]. The performance of K20X is 10 times higher than Intel’s high-end CPUs. Due to the high performance computability of GPU, K20X was adapted to Titan, the first super computer in the world [2][3]. However, additional steps are needed in GPU computing process, which aren’t needed in the computation using only CPU. The data required to execute on GPU has to move from main memory to global memory of GPU before GPU computation. The results created on GPU also have to write back to main memory. The data movement is called as CPU-GPU communication. The communication between CPU and GPU is a big part of the computation using GPU. So, many studies tried to optimize CPU-GPU communication [4][5]. In this paper, we evaluated the performance of CPU-GPU communication depending on co-located workloads and presented which workload severely degraded the performance of CPU-GPU communication.
Authors and Affiliations
Dongyou Seo , Shin-gyu Kim , Hyeonsang Eom , Heon Y. Yeom
Estimation of worst case latency of periodic tasks in a real time distributed environment
Real-time computations require exact bounded response times. For relatively simple models of computation, it is possible to determine conditions under which it is theoretically possible to guarantee that an invocation of...
A Study on the Effectiveness of Different Patch Size and Shape for Eyes and Mouth Detection
Template matching is one of the simplest methods used for eyes and mouth detection. However, it can be modified and extended to become a powerful tool. Since the patch itself plays a significant role in optimizing detect...
K-Partition Model for Mining Frequent Patterns in Large Databases
Mining frequent patterns has always been a great field of research for investigators. Various algorithms were developed for finding out frequent patterns in an efficient manner. But the major drawback of all these resear...
Medical Image Matching and Retrieval using Discrete Sine Transform
Visual information has been extensively used in the areas of multimedia, medical imaging and other numerous applications. Management of these visual information is challenging as the quantity of data available is very hu...
NATURAL LANGUAGE QUERY PROCESSING USING SEMANTIC GRAMMAR
Abstract: The field of natural language processing (NLP) has seen a dramatic shift in both research direction and methodology in the past several years. In the past, most work in computational linguistics tended to focus...