首页 | 本学科首页   官方微博 | 高级检索  
     

动态武器目标分配问题中策略优化的改进算法
引用本文:陈英武,蔡怀平,邢立宁. 动态武器目标分配问题中策略优化的改进算法[J]. 系统工程理论与实践, 2007, 27(7): 160-165. DOI: 10.12011/1000-6788(2007)7-160
作者姓名:陈英武  蔡怀平  邢立宁
作者单位:1. 国防科技大学,信息系统与管理学院,长沙,410073
2. 国防科技大学,信息系统与管理学院,长沙,410073;中国人民解放军95851部队,南京,210046
摘    要:动态武器目标分配(Weapon Target Assignment,WTA)中的目标选择策略问题可以通过建立马尔可夫决策过程(Markov decision processes,MDP)模型进行研究,但目前尚无有效求解此类较大规模的MDP问题中最优策略的算法.通过分析动态WTA问题的MDP模型特点,给出了求解该问题最优策略的改进算法.该算法主要在初始策略选取规则、策略改进规则以及最优策略的判断准则等方面进行了改进.该算法具有计算量小,节省内存,并可得到最优解等优点.最后,通过算例将该算法与传统算法进行了比较.改进算法可以用于解决较大规模的动态WTA中的策略优化问题.

关 键 词:运筹学  动态武器目标分配  算法  策略优化  马尔可夫决策过程
文章编号:1000-6788(2007)07-0160-06
修稿时间:2005-12-30

An Improved Algorithm of Policies Optimization of Dynamic Weapon Target Assignment Problem
CHEN Ying-wu,CAI Huai-ping,XING Li-ning. An Improved Algorithm of Policies Optimization of Dynamic Weapon Target Assignment Problem[J]. Systems Engineering —Theory & Practice, 2007, 27(7): 160-165. DOI: 10.12011/1000-6788(2007)7-160
Authors:CHEN Ying-wu  CAI Huai-ping  XING Li-ning
Abstract:The policies optimization problem of dynamic weapon target assignment(WTA) could be modeled with Markov decision processes(MDP);however,there have been no effective algorithms to solve the optimal policies of such large-scale problems by now.The characteristics of the MDP are analyzed,and the improved algorithm to solve optimal policies of the problem is proposed correspondingly.The algorithm is mainly improved in the selection rule of initial policy,the improvement rule of policy and the evaluation criterion of optimal policies,so both the storage space and computing time are reduced.Meanwhile the optimal solution of the MDP problem could be obtained by the improved algorithm.Finally,a simple comparison between the improved algorithm and conventional algorithm is given through an example.It can be concluded that the improvement algorithm is suitable to solve large-scale problems such as the policies optimization problem of dynamic WTA.
Keywords:operations research  dynamic weapon target assignment  algorithm  policy optimization  Markov decision process(MDP)  mathematical model
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《系统工程理论与实践》浏览原始摘要信息
点击此处可从《系统工程理论与实践》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号