论文标题

拉格朗日运输模型的改进的多GPU并行化

Improved Multi-GPU parallelization of a Lagrangian Transport Model

论文作者

Bolarinwa, Saheed

论文摘要

该报告强调了我们通过支持多个GPU的计算节点来改善GPU并行化的工作。但是,由于OpenACC中对多GPU的默认支持受到限制[6],因此当前的实现允许每个MPI进程仅访问一个GPU。因此,充分利用当前版本中多GPU节点的唯一方法是启动多个流程,从而增加了资源争议。我们研究了只有一个流程卸载到所有可用的GPU设备的好处。

This report highlights our work on improving GPU parallelization by supporting compute nodes with multiple GPUs. However, since the default support for multi-GPUs in OpenACC is limited[6], the current implementation allows each MPI process to access only a single GPU. Thus, the only way to take full advantage of multi-GPU nodes in the current version is to launch multiple processes, which increases resource contention. We investigated the benefits of having only one process offload to all available GPU devices.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源