首页 | 本学科首页   官方微博 | 高级检索  
     

基于马尔可夫的多功能雷达认知干扰决策建模研究
引用本文:朱霸坤,朱卫纲,李伟,杨莹,高天昊. 基于马尔可夫的多功能雷达认知干扰决策建模研究[J]. 系统工程与电子技术, 2022, 44(8): 2488-2497. DOI: 10.12305/j.issn.1001-506X.2022.08.13
作者姓名:朱霸坤  朱卫纲  李伟  杨莹  高天昊
作者单位:1. 航天工程大学电子光学工程系, 北京 1014162. 电子信息系统复杂电磁环境效应国家重点实验室, 河南 洛阳 4710323. 航天工程大学研究生院, 北京 101416
基金项目:CEMEE国家重点实验室项目(CEMEE2020Z0203B)
摘    要:多功能雷达是现代电磁战场上不可或缺的重要装备, 针对多功能雷达的干扰一直是一个难题。本文在研究多功能雷达信号特点和雷达对抗过程的基础上, 提出了雷达状态联合表征的方法, 将多功能雷达的干扰决策问题建模为一个带收益的马尔可夫决策过程, 设计了认知干扰决策系统, 并通过基于Q-Learning的认知干扰决策算法求解该模型下的最佳干扰策略。通过仿真实验, 证明了基于Q-Learning的认知干扰决策算法能够在缺乏先验经验的情况下学习到最佳干扰策略, 具备“认知”的特性, 并且在不稳定的环境中也具有较强的适应性, 有效支撑了本文所提的干扰决策模型。

关 键 词:雷达对抗  马尔可夫决策过程  雷达状态  强化学习  Q-Learning  
收稿时间:2021-06-01

Research on decision-making modeling of cognitive jamming for multi-functional radar based on Markov
Bakun ZHU,Weigang ZHU,Wei LI,Ying YANG,Tianhao GAO. Research on decision-making modeling of cognitive jamming for multi-functional radar based on Markov[J]. System Engineering and Electronics, 2022, 44(8): 2488-2497. DOI: 10.12305/j.issn.1001-506X.2022.08.13
Authors:Bakun ZHU  Weigang ZHU  Wei LI  Ying YANG  Tianhao GAO
Affiliation:1. Department of Electronic and Optical Engineering, Space Engineering University, Beijing 101416, China2. State Key Laboratory of Complex Electromagnetic Environment Effects on Electronics and Information System, Luoyang 471032, China3. Graduate School, Space Engineering University, Beijing 101416, China
Abstract:Multi-functional radar is an indispensable and important equipment in modern electromagnetic battlefield. The interference of multi-functional radar is always a difficult problem. In this paper, based on the study of the characteristics of multi-functional radar signal and the radar countermeasure process, the method of joint representation of radar state is proposed, and the interference problem of multi-functional radar is modeled as a Markov decision process with benefits. The cognitive interference decision system is designed. The interference strategy is solved by the cognitive interference decision algorithm based on Q-learning. Through the simulation experiment, it is proved that the cognitive interference decision algorithm based on Q-learning can learn the optimal interference strategy in the absence of prior experience, have the characteristic of cognition, and have strong adaptability in the unstable environment, which effectively supports the interference decision model mentioned above.
Keywords:radar confrontation  Markov decision process  radar state  reinforcement learning  Q-learning  
点击此处可从《系统工程与电子技术》浏览原始摘要信息
点击此处可从《系统工程与电子技术》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号