A Feasibility Study on Porting the Community Land Model onto Accelerators Using Openacc
Journal Title: International Journal of Advanced Computer Science & Applications - Year 2014, Vol 5, Issue 12
Abstract
As environmental models (such as Accelerated Climate Model for Energy (ACME), Parallel Reactive Flow and Transport Model (PFLOTRAN), Arctic Terrestrial Simulator (ATS), etc.) became more and more complicated, we are facing enormous challenges regarding to porting those applications onto hybrid computing architecture. OpenACC emerges as a very promising technology, therefore, we have conducted a feasibility analysis on porting the Community Land Model (CLM), a terrestrial ecosystem model within the Community Earth System Models (CESM)). Specifically, we used automatic function testing platform to extract a small computing kernel out of CLM, then we apply this kernel into the actually CLM dataflow procedure, and investigate the strategy of data parallelization and the benefit of data movement provided by current implementation of OpenACC. Even it is a non-intensive kernel, on a single 16-core computing node, the performance (based on the actual computation time using one GPU) of OpenACC implementation is 2.3 time faster than that of OpenMP implementation using single OpenMP thread, but it is 2.8 times slower than the performance of OpenMP implementation using 16 threads. On multiple nodes, MPI_OpenACC implementation demonstrated very good scalability on up to 128 GPUs on 128 computing nodes. This study also provides useful information for us to look into the potential benefits of “deep copy” capability and “routine” feature of OpenACC standards. We believe that our experience on the environmental model, CLM, can be beneficial to many other scientific research programs who are interested to porting their large scale scientific code using OpenACC onto high-end computers, empowered by hybrid computing architecture.
Authors and Affiliations
D. Wang, W. Wu, F. Winkler, O. Hernandez
Designing Novel Queries for Analysing NoSQL Data of Gene-Disease Associations
To precisely identify gene associated diseases has been an open area of research for biological scientists to ensure clinical and psychological symptoms and treatment for human diseases. Because whole Human Genome is def...
An Efficent Lossless Compression Scheme for ECG Signal
Cardiac diseases constitute the main cause of mortality around the globe. For detection and identification of cardiac problems, it is very important to monitor the patient's heart activities for long periods during his n...
Comparative Analysis of Network Libraries for Offloading Efficiency in Mobile Cloud Environment
In the modern era, smartphones are increasingly becoming an integral and essential part of our daily life. Although the hardware capabilities of the smartphones (i.e., processing, memory, battery, and communication) are...
Quantitative Analysis of Healthy and Pathological Vocal Fold Vibrations using an Optical Flow based Waveform
The objective assessment of the vocal fold vibrations is important in diagnosing several vocal diseases. Given the high speed of the vibrations, the high speed videoendoscopy is commonly used to capture the vocal fold mo...
Brain-Controlled for Changing Modular Robot Configuration by Employing Neurosky’s Headset
Currently, the Brain Computer Interfaces (BCI) system was designed mostly to be implemented for control purpose or navigation which are mostly being employed for mobile robot, manipulator robot and humanoid robot by usin...