基于强化学习的多机协同传感器管理 Multi-airborne cooperative sensor management based on reinforcement learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于强化学习的多机协同传感器管理

引用本文：	闫实,贺静,王跃东,孙自强,梁彦. 基于强化学习的多机协同传感器管理[J]. 系统工程与电子技术, 2020, 42(8): 1726-1733. DOI: 10.3969/j.issn.1001-506X.2020.08.12

作者姓名：	闫实贺静王跃东孙自强梁彦

作者单位：	1. 西北工业大学自动化学院, 陕西西安 7100722. 信息融合教育部重点实验室, 陕西西安 7100723. 南京电子技术研究所, 江苏南京 210039

基金项目：	国家自然科学基金(61771399);国家自然科学基金(61873205)

摘要：	网络化战争中,机载雷达在实现对目标信息持续获取的同时保证载机安全生存是亟待解决的问题。对此,以多机协同作战安全转场任务为背景,提出基于深度强化学习算法的智能传感器管理方法。首先,综合考虑信号辐射量与目标威胁因素,计算目标运动过程中的实时威胁隶属度。其次,在强化学习框架下对雷达-目标分派问题建模,利用神经网络逼近动作-值函数,并根据时序差分算法进行参数更新。仿真结果表明,相比于传统调度方法,所提算法有效提升了任务成功率,缩短了任务完成用时。
关键词：	传感器管理强化学习威胁隶属度
收稿时间：	2020-01-13
Multi-airborne cooperative sensor management based on reinforcement learning

Shi YAN,Jing HE,Yuedong WANG,Ziqiang SUN,Yan LIANG. Multi-airborne cooperative sensor management based on reinforcement learning[J]. System Engineering and Electronics, 2020, 42(8): 1726-1733. DOI: 10.3969/j.issn.1001-506X.2020.08.12

Authors:	Shi YAN Jing HE Yuedong WANG Ziqiang SUN Yan LIANG

Affiliation:	1. School of Automation, Northwestern Polytechnical University, Xi'an 710072, China2. Key Laboratory of Information Fusion, Ministry of Education, Xi'an 710072, China3. Nanjing Institute of Electronic Technology, Nanjing 210039, China

Abstract:	In the networked war, it is urgent that airborne radar can continuously acquire target information while ensuring the safe survival. Focusing on this problem, in the context of safe transition tasks of multi-airborne cooperative operations, this paper proposes a intelligent sensor management method based on deep reinforcement learning. First, the real-time threat membership is calculated considering the signal radiation and several threat factors. Then, the radar-target assignment problem is modeled in a reinforcement learning framework. The neural network is used to approximate the action-value function, and the parameters are updated according to the temporal-difference algorithm. It can be seen from the simulation that the proposed algorithm improves the task success rate and shortens the time of task completion compared with the traditional scheduling methods.

Keywords:	sensor management reinforcement learning threat membership

	点击此处可从《系统工程与电子技术》浏览原始摘要信息
	点击此处可从《系统工程与电子技术》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏