Performance modeling of 3D MPDATA simulations on GPU cluster
详细信息    查看全文
  • 作者:Krzysztof Rojek ; Roman Wyrzykowski
  • 关键词:EULAG ; MPDATA ; Stencils ; GPU cluster ; MPI ; Performance model
  • 刊名:The Journal of Supercomputing
  • 出版年:2017
  • 出版时间:February 2017
  • 年:2017
  • 卷:73
  • 期:2
  • 页码:664-675
  • 全文大小:660KB
  • 刊物类别:Computer Science
  • 刊物主题:Programming Languages, Compilers, Interpreters; Processor Architectures; Computer Science, general;
  • 出版者:Springer US
  • ISSN:1573-0484
  • 卷排序:73
文摘
The goal of this study is to parallelize the multidimensional positive definite advection transport algorithm (MPDATA) across a computational cluster equipped with GPUs. Our approach permits us to provide an extensive overlapping GPU computations and data transfers, both between computational nodes, as well as between the GPU accelerator and CPU host within a node. For this aim, we decompose a computational domain into two unequal parts which correspond to either data dependent or data independent parts. Then, data transfers can be performed simultaneously with computations corresponding to the second part. Our approach allows for achieving 16.372 Tflop/s using 136 GPUs. To estimate the scalability of the proposed approach, a performance model dedicated to MPDATA simulations is developed. We focus on the analysis of computation and communication execution times, as well as the influence of overlapping data transfers and GPU computations, with regard to the number of nodes.
NGLC 2004-2010.National Geological Library of China All Rights Reserved.
Add:29 Xueyuan Rd,Haidian District,Beijing,PRC. Mail Add: 8324 mailbox 100083
For exchange or info please contact us via email.