|
|
|
| The Guidance and Control Method of Multi-Missile Cooperative Encirclement of Maneuvering Targets Based on Proximal Policy Optimization |
| ZHANG Wanying1, SIMA Ke2, ZHANG Yuhe3, MENG Jian3,
YANG Zhen3, ZHOU Deyun3 |
| 1. College of Microelectronics, Northwestern Polytechnical University, Xi’an 710072, Shaanxi, China;
2. Shanghai Electro-Mechanical Engineering Institute, Shanghai 201109, China; 3. College of
Electronics and Information, Northwestern Polytechnical University, Xi’an 710072, Shaanxi, China |
|
|
|
|
Abstract To resolve cooperative encirclement by multiple missiles against a manoeuvring target in three-dimensional space, this study proposed an impact-time-control cooperative guidance using proximal policy optimisation (PPO). Firstly, the impact-time-control cooperative guidance model was constructed based on the extended proportional guidance, and the cooperative guidance time error term was improved. Then, the state and action space models for the Markov Decision Process were designed, and the reward function was constructed as a variable-step model combining dense and sparse rewards. The cooperative guidance model was trained using PPO, mapping the guidance state information to the cooperative guidance law. Finally, a multiple-missile cooperative encirclement scenario was established, showcasing the cooperative guidance's ability to achieve model-free, end-to-end coordinated attack timing. Monte Carlo experiments further verified the robustness of its guidance in disturbed environments.
|
|
Received: 27 April 2025
Published: 10 September 2025
|
|
|
|
|
|
| [1] |
WANG Zhibo, HU Weijun, MA Xianlong, QUAN Jiale, ZHOU Haoyu. Perception-Driven-Controlled UAV Interception and Collision Technology[J]. Air & Space Defense, 2025, 8(4): 78-84. |
| [2] |
ZHOU Wenjie, FU Yulong, GUO Xiangke, QI Yutao, ZHANG Haibin. Air Combat Decision-Making Method Based on Game Tree and Digital Parallel Simulation Battlefield[J]. Air & Space Defense, 2025, 8(3): 50-58. |
| [3] |
LI Yijia, LI Jianuo, KE Liangjun. Design and Verification of UAV Cooperative Defense Strategy Based on Reinforcement Learning[J]. Air & Space Defense, 2025, 8(3): 73-85. |
| [4] |
DU Junnan, SHUAI Yixian, CHEN Ding, WANG Min, ZHOU Jinpeng. A Cooperative Deployment Algorithm for Marine Fleet Detection Nodes Based on Constrained Reinforcement Learning[J]. Air & Space Defense, 2025, 8(3): 95-103. |
| [5] |
ZHANG Yuge, GENG Jianqiang, YANG Guangyu, ZHU Supeng, HOU Zhenqian, FU Wenxing. Multi-Missile Cooperative Passive Localization Algorithm Based on IMM-SRCKF for Maneuvering Targets[J]. Air & Space Defense, 2025, 8(2): 58-65. |
| [6] |
LIU Huahua, WANG Qing. Multi-Aircraft Target Assignment Method Based on Reinforcement Learning[J]. Air & Space Defense, 2024, 7(5): 65-72. |
| [7] |
QUAN Jiale, MA Xianlong, SHEN Yuheng. Multi-agent Formation Method Based on Dynamic Optimization of Proximal Policies[J]. Air & Space Defense, 2024, 7(2): 52-62. |
| [8] |
GUO Jianguo, HU Guanjie, XU Xinpeng, LIU Yue, CAO Jin. Reinforcement Learning-Based Target Assignment Method for Many-to-Many Interceptions[J]. Air & Space Defense, 2024, 7(1): 24-31. |
| [9] |
WANG Xu, CAI Yuanli, ZHANG Xuecheng, ZHANG Rongliang, HAN Chenglong. Intercept Guidance Law with a Low Acceleration Ratio Based on Hierarchical Reinforcement Learning[J]. Air & Space Defense, 2024, 7(1): 40-47. |
| [10] |
MA Chi, ZHANG Guoqun, SUN Junge, LYU Guangzhe, ZHANG Tao. Deep Reinforcement Learning-Based Reconfiguration Method for Integrated Electronic Systems[J]. Air & Space Defense, 2024, 7(1): 63-70. |
| [11] |
LI Mengxuan, GUO Jianguo, XU Xinpeng, SHEN Yuheng. Guidance Law Based on Proximal Policy Optimization[J]. Air & Space Defense, 2023, 6(4): 51-57. |
| [12] |
LUO Tong, ZHANG Min, LIANG Chengyu. Multi-UAV Cooperative Target Tracking and Guidance Law Design[J]. Air & Space Defense, 2023, 6(3): 113-118. |
| [13] |
SUN Xinglong, MA Kemao, JIANG Yu, HOU Zhenqian. Interception Strategy Design of High Supersonic Targets in the Near Space[J]. Air & Space Defense, 2022, 5(4): 10-18. |
| [14] |
SU Shan, XIE Yongji, BAI Yulian, LIU Yintian, SHAN Yongzhi. Research on Differential Game Cooperative Confrontation Guidance Law Method[J]. Air & Space Defense, 2022, 5(2): 58-64. |
| [15] |
LIU Shuangxi, WANG Yichong, ZHU Mengjie, LI Yong, YAN Binbin. Research on Differential Game Guidance Law for Intercepting Hypersonic Vehicles with Small Missile-to-Target Speed Ratio[J]. Air & Space Defense, 2022, 5(2): 49-57. |
|
|
|
|