首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于深度图强化学习的低轨卫星网络动态路由算法
引用本文:汪昊,冉泳屹,赵雷,王俊霞,雒江涛,张涛.基于深度图强化学习的低轨卫星网络动态路由算法[J].重庆邮电大学学报(自然科学版),2023,35(4):596-605.
作者姓名:汪昊  冉泳屹  赵雷  王俊霞  雒江涛  张涛
作者单位:重庆邮电大学 通信与信息工程学院,重庆 400065
基金项目:国家自然科学基金项目(62171072,62172064,62003067);重庆市自然科学基金项目(cstc2021jcyj-msxmX0586)
摘    要:为了解决高移动性导致卫星网络路由难以计算的问题,融合图神经网络和深度强化学习,提出一种基于深度图强化学习的低轨卫星网络动态路由算法。考虑卫星网络拓扑和卫星间链路的可用带宽、传播时延等约束,构建卫星网络状态,通过图神经网络对其进行表示学习;根据此状态的图神经网络表示,深度强化学习智能体选择相应的决策动作,使卫星网络长期平均吞吐量达到最大并保证平均时延最小。仿真结果表明,所提算法在保证较小时延的同时,还能提升卫星网络吞吐量和降低丢包率。此外,图神经网络强大的泛化能力使所提算法具有更好的抗毁性能。

关 键 词:低轨卫星网络  动态路由算法  图神经网络  深度强化学习
收稿时间:2021/11/8 0:00:00
修稿时间:2023/6/10 0:00:00

Dynamic routing algorithm for LEO satellite networks based on deep graph reinforcement learning
WANG Hao,RAN Yongyi,ZHAO Lei,WANG Junxi,LUO Jiangtao,ZHANG Tao.Dynamic routing algorithm for LEO satellite networks based on deep graph reinforcement learning[J].Journal of Chongqing University of Posts and Telecommunications,2023,35(4):596-605.
Authors:WANG Hao  RAN Yongyi  ZHAO Lei  WANG Junxi  LUO Jiangtao  ZHANG Tao
Institution:School of Communications and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, P. R. China
Abstract:To solve the problem that satellite network routing is difficult to calculate due to the high mobility of satellites, we propose a dynamic routing algorithm based on deep graph reinforcement learning for LEO satellite networks fusing graph neural network and deep reinforcement learning. Considering the constraints of satellite network topology and available bandwidth and propagation delay of inter-satellite links, the satellite network state is constructed, and its representation is learned by graph neural network. Based on the graph neural network representation of this state, the deep reinforcement learning agent selects the corresponding decision action to maximize the long-term average throughput of the satellite network and ensure the minimum average delay. Simulation results show that the proposed algorithm can improve the satellite network throughput and reduce the packet loss rate while guaranteeing a lower latency. In addition, the powerful generalization capability of the graph neural network enables the proposed algorithm to have better anti-destruction performance.
Keywords:low earth orbit (LEO) satellite networks  dynamic routing algorithm  graph neural network  deep reinforcement learning
点击此处可从《重庆邮电大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《重庆邮电大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号