首页 | 本学科首页   官方微博 | 高级检索  
     

基于强化学习的多机协同传感器管理
引用本文:闫实,贺静,王跃东,孙自强,梁彦. 基于强化学习的多机协同传感器管理[J]. 系统工程与电子技术, 2020, 42(8): 1726-1733. DOI: 10.3969/j.issn.1001-506X.2020.08.12
作者姓名:闫实  贺静  王跃东  孙自强  梁彦
作者单位:1. 西北工业大学自动化学院, 陕西 西安 7100722. 信息融合教育部重点实验室, 陕西 西安 7100723. 南京电子技术研究所, 江苏 南京 210039
基金项目:国家自然科学基金(61771399);国家自然科学基金(61873205)
摘    要:网络化战争中,机载雷达在实现对目标信息持续获取的同时保证载机安全生存是亟待解决的问题。对此,以多机协同作战安全转场任务为背景,提出基于深度强化学习算法的智能传感器管理方法。首先,综合考虑信号辐射量与目标威胁因素,计算目标运动过程中的实时威胁隶属度。其次,在强化学习框架下对雷达-目标分派问题建模,利用神经网络逼近动作-值函数,并根据时序差分算法进行参数更新。仿真结果表明,相比于传统调度方法,所提算法有效提升了任务成功率,缩短了任务完成用时。

关 键 词:传感器管理  强化学习  威胁隶属度  
收稿时间:2020-01-13

Multi-airborne cooperative sensor management based on reinforcement learning
Shi YAN,Jing HE,Yuedong WANG,Ziqiang SUN,Yan LIANG. Multi-airborne cooperative sensor management based on reinforcement learning[J]. System Engineering and Electronics, 2020, 42(8): 1726-1733. DOI: 10.3969/j.issn.1001-506X.2020.08.12
Authors:Shi YAN  Jing HE  Yuedong WANG  Ziqiang SUN  Yan LIANG
Affiliation:1. School of Automation, Northwestern Polytechnical University, Xi'an 710072, China2. Key Laboratory of Information Fusion, Ministry of Education, Xi'an 710072, China3. Nanjing Institute of Electronic Technology, Nanjing 210039, China
Abstract:In the networked war, it is urgent that airborne radar can continuously acquire target information while ensuring the safe survival. Focusing on this problem, in the context of safe transition tasks of multi-airborne cooperative operations, this paper proposes a intelligent sensor management method based on deep reinforcement learning. First, the real-time threat membership is calculated considering the signal radiation and several threat factors. Then, the radar-target assignment problem is modeled in a reinforcement learning framework. The neural network is used to approximate the action-value function, and the parameters are updated according to the temporal-difference algorithm. It can be seen from the simulation that the proposed algorithm improves the task success rate and shortens the time of task completion compared with the traditional scheduling methods.
Keywords:sensor management  reinforcement learning  threat membership  
点击此处可从《系统工程与电子技术》浏览原始摘要信息
点击此处可从《系统工程与电子技术》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号