A Feasibility Study on Porting the Community Land Model onto Accelerators Using Openacc
Journal Title: International Journal of Advanced Computer Science & Applications - Year 2014, Vol 5, Issue 12
Abstract
As environmental models (such as Accelerated Climate Model for Energy (ACME), Parallel Reactive Flow and Transport Model (PFLOTRAN), Arctic Terrestrial Simulator (ATS), etc.) became more and more complicated, we are facing enormous challenges regarding to porting those applications onto hybrid computing architecture. OpenACC emerges as a very promising technology, therefore, we have conducted a feasibility analysis on porting the Community Land Model (CLM), a terrestrial ecosystem model within the Community Earth System Models (CESM)). Specifically, we used automatic function testing platform to extract a small computing kernel out of CLM, then we apply this kernel into the actually CLM dataflow procedure, and investigate the strategy of data parallelization and the benefit of data movement provided by current implementation of OpenACC. Even it is a non-intensive kernel, on a single 16-core computing node, the performance (based on the actual computation time using one GPU) of OpenACC implementation is 2.3 time faster than that of OpenMP implementation using single OpenMP thread, but it is 2.8 times slower than the performance of OpenMP implementation using 16 threads. On multiple nodes, MPI_OpenACC implementation demonstrated very good scalability on up to 128 GPUs on 128 computing nodes. This study also provides useful information for us to look into the potential benefits of “deep copy” capability and “routine” feature of OpenACC standards. We believe that our experience on the environmental model, CLM, can be beneficial to many other scientific research programs who are interested to porting their large scale scientific code using OpenACC onto high-end computers, empowered by hybrid computing architecture.
Authors and Affiliations
D. Wang, W. Wu, F. Winkler, O. Hernandez
Activity Based Learning Kits for Children in a Disadvantaged Community According to the Project “Vocational Teachers Teach Children to Create Virtuous Robots from Garbage”
This research was aimed to develop and evaluate the activity based learning kits for children in a disadvantaged community according to the project “Vocational Teachers Teach Children to Create Virtuous Robots from Garba...
Model Reference Adaptive Control Design for Nonlinear Plants
In this paper, the basic theory of the model reference adaptive control design and issues of particular relevance to control nonlinear dynamic plants with a relative degree greater than or equal to one with unknown param...
Measuring the Data Openness for the Open Data in Saudi Arabia e-Government – A Case Study
Conceptually, data can be found at the lowest level of abstraction from where information and knowledge are being extracted. Furthermore, data itself has no meaning, unless it’s being interpreted and transferred into inf...
Optimization of OADM DWDM Ring Optical Network using Various Modulation Formats
In this paper, the performance of the ring optical network is analyzed at bit rate 2.5 Gbps and 5 Gbps for various modulation formats such as NRZ rectangular, NRZ raised cosine, RZ soliton, RZ super Gaussian, RZ raised c...
The Real-Time Research of Optimal Power Flow Calculation in Reduce Active Power Loss Aspects of Power Grid
In order to research how to availably reduce the active power loss value in power grid system when the power system is operating, it offers a quantitative research in theory through conceiving the unbalanced losses of po...