宏微网络中基于后决策状态学习的基站关断节能策略 Post-decision state learning based on/off strategies for energy saving in heterogeneous networks期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

宏微网络中基于后决策状态学习的基站关断节能策略

引用本文：	陈端云,夏炳森,陈卓琳,王威丽.宏微网络中基于后决策状态学习的基站关断节能策略[J].重庆邮电大学学报(自然科学版),2021,33(4):536-545.

作者姓名：	陈端云夏炳森陈卓琳王威丽

作者单位：	国网福建省电力有限公司,福州350003;国网福建省电力有限公司经济技术研究院,福州350012;重庆邮电大学通信与信息工程学院,重庆400065

基金项目：	国家自然科学基金（61571073）；国网福建省电力有限公司2019年度科研项目（52130N18000U）

摘要：	为了根据网络的业务状态动态地调整基站的开关状态以在保证用户服务质量的同时降低宏微网络的能量消耗,提出将时延限制下长期平均能耗的最小化问题映射为受限马尔科夫决策过程(constrained Markov decision process,CMDP).在该过程中把网络中每个基站的用户数目定义为系统状态,将每个小基站的开/关操作定义为网络行动.为了充分利用网络已知的先验知识加快学习速度,采用后决策状态学习算法在线更新网络的开/关策略.该算法可根据关断策略执行前已知的网络状态学习关断策略执行后未知的网络状态,从而快速制定出相应的关断策略.理论验证了后决策状态学习算法的收敛性.通过仿真对比可知,后决策状态学习算法不仅学习速度快,且可在保证网络服务质量的同时收敛到最优的基站关断节能策略.
关键词：	宏微网络节能关断策略后决策状态学习
收稿时间：	2019/11/25 0:00:00
修稿时间：	2021/3/12 0:00:00
Post-decision state learning based on/off strategies for energy saving in heterogeneous networks

CHEN Duanyun,XIA Bingsen,CHEN Zhuolin,WANG Weili.Post-decision state learning based on/off strategies for energy saving in heterogeneous networks[J].Journal of Chongqing University of Posts and Telecommunications,2021,33(4):536-545.

Authors:	CHEN Duanyun XIA Bingsen CHEN Zhuolin WANG Weili

Institution:	State Grid Fujian Electric Power Company, Fuzhou 350003, P. R. China;Economic Technology Research Institute, State Grid Fujian Electric Power Company, Fuzhou 350012, P. R. China; College of Information and Communication Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, P. R. China

Abstract:	To adjust the on/off states of base stations (BSs) dynamically based on the network load for reducing the energy consumption of the heterogeneous networks while ensuring the quality of service (QoS) of users, this paper maps the problem about the minimization of long-term average energy consumption under delay limit to a constrained Markov decision process (CMDP), where the user numbers of all BSs are defined as system states and the switching operations of all SBSs are defined as network actions. To take full advantage of the network prior knowledge to speed up the learning rate, this paper adopts the post-decision state learning algorithm to update on/off strategies online. This algorithm will learn the unknown network states after implementing on/off strategies based on the known network states, and then the proper actions can be determined. Besides, the convergence of the post-decision state learning algorithm is theoretically verified. The comparative simulation demonstrates that the post-decision state learning algorithm not only learns faster, but can converge to the best on/off strategies while guaranteeing the service quality of the networks.

Keywords:	heterogeneous networks energy-saving on/off strategies post-decision state learning
本文献已被万方数据等数据库收录！
	点击此处可从《重庆邮电大学学报(自然科学版)》浏览原始摘要信息
	点击此处可从《重庆邮电大学学报(自然科学版)》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏