基于深度强化学习的作战辅助决策研究

全文: PDF(2570 KB)
输出: BibTeX | EndNote (RIS)

摘要面对瞬息万变的战场，如何有效地利用智能化技术实现计算机辅助决策，已经成为制约作战指挥控制技术发展的瓶颈。通过深入分析作战决策制定过程，将其转化为一个序列多步决策问题，使用深度学习方法提取包含指挥员情绪、行为和战法演变过程决策状态在内的战场特征向量，基于强化学习方法对策略状态行动空间进行搜索并对决策状态进行评估，直到获得最佳的行动决策序列，旨在实现未来战场“机脑对人脑”的博弈优势。

	服务

	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章

Abstract：Faced with the rapidly changing battlefield situation, how to make effective use of intelligent technology to achieve computer-aided decision has become a bottleneck restricting the development of command and control technology. Through the in-depth analysis of the combat decision-making process, it is transformed into an issue of multi-step sequential decision-making. Then the deep learning method is used to extract the characteristic vectors including the commander’s mood, behavior and the decision-making state in the tactics evolution process. The reinforcement learning method is used to search in the action space of decision state and evaluate the decision state until obtaining an optimal action sequence decision, to gain the advantage at the game of machine versus human in the future battlefield.

收稿日期: 2017-09-27 出版日期: 2017-12-25

ZTFLH:

TP273

作者简介: 周来（1983— ），男，博士，高级工程师，主要研究方向为指挥控制系统。

引用本文:

周来, 靳晓伟, 郑益凯. 基于深度强化学习的作战辅助决策研究[J]. 空天防御, 2018, 1(1): 31-35.
Zhou Lai, Jin Xiaowei, Zheng Yikai. Researchon Operational Decision Support Based on Deep Reinforcement Learning. Air & Space Defense, 2018, 1(1): 31-35.

链接本文:

https://www.qk.sjtu.edu.cn/ktfy/CN/ 或 https://www.qk.sjtu.edu.cn/ktfy/CN/Y2018/V1/I1/31

参考文献

[1]	贾岛, 陈磊, 朱志鹏, 余曜, 迟德建. 机器学习在引战系统设计中的应用研究[J]. 空天防御, 2022, 5(2): 27-31.
[2]	李勇, 张梦骏, 仇栋, 范云锋, 苏智勇, 邱令存. 数据驱动的指控系统增强现实电子沙盘设计与开发[J]. 空天防御, 2021, 4(2): 27-.
[3]	刘晨, 谢宝娣, 董国宝, 霍达, 段雨昕, 夏川. 基于自适应积分滑模的无人机编队控制器设计[J]. 空天防御, 2021, 4(1): 65-70.
[4]	梅瀚桐, 麻黎娟, 吴光辉, 邵翔, 许朋亚. 远程导弹的自适应反步姿态控制系统设计[J]. 空天防御, 2020, 3(3): 118-123.
[5]	吴鹏飞, 石然, 易志坤, 吴智杰, 仇存凯, 付伟平. 基于改进型神经网络PID算法的太阳翼α驱动控制技术[J]. 空天防御, 2018, 1(4): 8-17.
[6]	黄煜博, 沈暑龙, 李黎, 李军. 基于虚拟仪器技术的电动力矩加载系统[J]. 空天防御, 2018, 1(3): 48-55.