首页 | 本学科首页   官方微博 | 高级检索  
     检索      

LTE-V下基于深度强化学习的基站选择算法
引用本文:谢浩,郭爱煌,宋春林,焦润泽.LTE-V下基于深度强化学习的基站选择算法[J].系统工程与电子技术,2019,41(7):1652-1657.
作者姓名:谢浩  郭爱煌  宋春林  焦润泽
作者单位:1. 同济大学电子与信息工程学院, 上海 201804;; 2. 东南大学毫米波国家重点实验室, 江苏 南京 201804;
基金项目:毫米波国家重点实验室开放项目(K201935)资助课题
摘    要:针对长期演进-车辆(long term evolution-vehicle, LTE-V)下的车辆随机竞争接入网络容易造成网络拥塞的问题,提出基于深度强化学习(deep reinforcement learning,DRL)为LTE-V下的车辆接入最佳基站(evolved node B,eNB)的选择算法。使用LTE核心网中移动管理单元(mobility management entity,MME)作为代理,同时考虑网络侧负载与接收端接收速率,完成车辆与eNB的匹配问题,降低网络拥塞概率,减少网络时延。使用竞争-双重深度Q网络(dueling-double deep Q-network,D-DDQN)来拟合目标动作-估值函数(action -value function,AVF),完成高维状态输入-低维动作输出的转化。仿真表明,D-DDQN训练完成参数收敛后,LTE-V网络拥塞概率大幅下降,整体性能有较大提升。

关 键 词:长期演进-车辆  深度强化学习  基站选择  拥塞概率  网络负载均衡

eNB selection for LTE-V using deep reinforcement learning
XIE Hao,GUO Aihuang,SONG Chunlin,JIAO Runze.eNB selection for LTE-V using deep reinforcement learning[J].System Engineering and Electronics,2019,41(7):1652-1657.
Authors:XIE Hao  GUO Aihuang  SONG Chunlin  JIAO Runze
Institution:1. School of Electronics and Information Engineering, Tongji University, Shanghai 201804, China;; 2. State Key Laboratory of Millimeter Waves, Southeast University, Nanjing 210092, China;
Abstract:The source allocation scheme for long term evolution-vehicle (LTE-V) is based on random selection, which will cause serious network congestion easily. Based on deep reinforcement learning (DRL), an best access evolved node B (eNB) selection algorithm for the vehicle type communication under LTE-V network is proposed. In order to reduce both the blocking probability and communication delays of LTE-V network, the mobility management entity (MME) is used as an agent, also the receiving rate at user side and network loading at network side are taking into consideration. Meanwhile, dueling-double deep Q-network (D-DDQN) is adopt to fit the target action-value function (AVF). D-DDQN can convert the high dimension state inputs to the low dimension action outputs. The simulation shows that the blocking probability of LTE-V network is reduced significantly after the convergence of DQN’s parameters and the properties of the entire network is improved greatly.
Keywords:long term evolution-vehicle (LTE-V)  deep reinforcement learning (DRL)  evolved node B (eNB) selection  network blocking probability  load balance
本文献已被 维普 等数据库收录!
点击此处可从《系统工程与电子技术》浏览原始摘要信息
点击此处可从《系统工程与电子技术》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号