首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于CUDA的GPU条件分支分歧聚合优化策略
引用本文:刘素芹,王 鑫,安仲奇,杨娜利,王俊爽.基于CUDA的GPU条件分支分歧聚合优化策略[J].中国石油大学学报(自然科学版),2014(3):174-180.
作者姓名:刘素芹  王 鑫  安仲奇  杨娜利  王俊爽
作者单位:中国石油大学计算机与通信工程学院;
基金项目:中央高校基本科研业务费专项(09CX04061A)
摘    要:分析NVIDIA GPU底层处理SIMD条件分支分歧的方式及其对程序性能产生的影响。在软件层级提出两种利用"聚合"思想的SIMD条件分支分歧优化策略:循环推迟和循环提前。策略将不同SIMD道中选择相同路径的条件分支"聚合"到同一步循环中,减少了SIMD操作的实际次数。使用CUDA对这两种策略进行的试验结果表明,在满足策略使用条件的前提下能够取得预想中的加速比。该策略实现难度较低、可操作性较强。

关 键 词:SIMD  条件分支分歧  聚合  循环推迟  循环提前
收稿时间:2013/6/28 0:00:00

GPU conditional branch divergence converging optimization strategies based on CUDA
LIU Su-qin,WANG Xin,AN Zhong-qi,YANG Na-li and WANG Jun-shuang.GPU conditional branch divergence converging optimization strategies based on CUDA[J].Journal of China University of Petroleum,2014(3):174-180.
Authors:LIU Su-qin  WANG Xin  AN Zhong-qi  YANG Na-li and WANG Jun-shuang
Institution:LIU Su-qin;WANG Xin;AN Zhong-qi;YANG Na-li;WANG Jun-shuang;College of Computer Science and Technology in China University of Petroleum;
Abstract:The underlying rules how the NVIDIA GPU deals single instruction multiple data (SIMD) conditional branch divergence and its impact on application performance were analyzed. Based on loop postpone or loop advance, two SIMD conditional branch divergence optimization strategies were proposed, in which conditional branches that choose the same path in different SIMD lanes were merged into one loop step, resulting in reducing the number of SIMD operations. The experimental results of these two strategies by using CUDA show that the application achieves expected speedup when the conditions of applying these strategies are met. These strategies are less difficult to implement and have a strong operability.
Keywords:single instruction multiple data(SIMD)  conditional branch divergence  converging  loop postpone  loop advance
本文献已被 CNKI 等数据库收录!
点击此处可从《中国石油大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《中国石油大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号