首页 | 本学科首页   官方微博 | 高级检索  
     检索      

异构内存系统全局优化的数据预取算法
引用本文:裴颂文,赵梦旖,姬燕飞.异构内存系统全局优化的数据预取算法[J].上海理工大学学报,2019,41(1):22-29.
作者姓名:裴颂文  赵梦旖  姬燕飞
作者单位:上海理工大学光电信息与计算机工程学院;复旦大学管理学院
基金项目:中国博士后科学基金资助项目(2017M610230);国家自然科学基金资助项目(61775139,61332009);上海市自然科学基金资助项目(15ZR1428600);上海市浦江人才计划项目(PJ1407600)
摘    要:鉴于现有的数据预取算法不能满足高效能异构计算系统对动态随机存取存储器(DRAM)和非易失性存储器(NVM)相结合的新型异构存储器高效访问的要求,提出了一种模拟退火的全局优化数据预取算法(SADPA)。该算法在启发式搜索模拟退火算法的基础上,引入了随机因子,以避免局部最优,从而确定了全局优化阈值以预取NVM页面的有效数量。实验结果表明,该算法相对于静态阈值调整算法,平均访问延时降低了4%,每个时钟周期内的平均指令数(IPC)增加了10.1%;对于cactusADM应用,该算法相对于软硬件协同的动态阈值调整算法,系统能耗降低了3.4%。

关 键 词:异构内存系统  数据预取  模拟退火算法  全局优化
收稿时间:2018/1/27 0:00:00

Data Prefetching Algorithm for Globally Optimizing Heterogeneous Memory System
PEI Songwen,ZHAO Mengyi and JI Yanfei.Data Prefetching Algorithm for Globally Optimizing Heterogeneous Memory System[J].Journal of University of Shanghai For Science and Technology,2019,41(1):22-29.
Authors:PEI Songwen  ZHAO Mengyi and JI Yanfei
Institution:School of Optical-Electrical and Computer Engineering, University of Shanghai for Science and Technology, Shanghai 200093,China;School of Management, Fudan University, Shanghai 200433, China,School of Optical-Electrical and Computer Engineering, University of Shanghai for Science and Technology, Shanghai 200093,China and School of Optical-Electrical and Computer Engineering, University of Shanghai for Science and Technology, Shanghai 200093,China
Abstract:Due to the existing data prefetching algorithms can''t meet the requirements of the novel heterogeneous memory system combining the dynamic random access memory (DRAM) with the non-volatile memory (NVM) in high energy-efficiency heterogeneous computing systems, a simulated annealing data prefetching algorithm (SADPA) was proposed. It was a heuristic search inspired simulated annealing algorithm, in which a random factor was introduced to confirm the global optimal threshold and the valid number of prefetching NVM pages. The results show that the average accessing latency of SADPA is 4% lower than that of the static threshold adjustment algorithm, and the average instruction per cycle (IPC) of the SADPA is 10.1% greater than that of the static threshold adjustment algorithm. Besides, the systemic power supported by SADPA, as for the cactusADM, is reduced by 3.4% compared with the cooperative hardware/software dynamic threshold adjustment algorithm.
Keywords:heterogeneous memory system  data prefetching  simulated annealing algorithm  global optimum
本文献已被 CNKI 等数据库收录!
点击此处可从《上海理工大学学报》浏览原始摘要信息
点击此处可从《上海理工大学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号