基于多算法框架自适应层级共享的海上联合防空多智能体训练研究

Abstract
Figure/Table
References
Related Citation (3)

Download: PDF (1742 KB) (1 KB)
Export: BibTeX | EndNote (RIS)

Abstract Multi-agent training is widely applied in modern maritime joint air defence operations. When two opposing parties independently choose different learning algorithms, it is often necessary to ensure implementation consistency and to manage resource allocation. To achieve this, a cross-agent shared fully connected layer is typically employed as a common representational foundation. However, a static choice between “fully shared” and “fully private” structures fails to balance policy consistency with individual differentiation. At the same time, discrepancies in behaviour distribution and gradient statistics across algorithms may amplify negative transfer and training variance. To overcome the above challenges, this paper introduced adaptive layer sharing (ALS) across heterogeneous algorithms, enabling learnable gating mechanisms to dynamically weight between shared and private branches at every layer. In a small-scale, single-machine experimental setup, standardized and reproducible protocols were established to record and report game outcomes and compliance. When ALS was activated, the learned gating distributions and thresholded topologies would be extracted, creating an implementable and observable engineering baseline that provides a clear structural and metric foundation for future large-scale and multi-task evaluations.

Key words： maritime joint air defence multi-agent systems proximal policy optimization deep deterministic policy gradient shared encoder

Received: 30 October 2025 Published: 13 January 2026

ZTFLH:

V 57

	Service

	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors

Cite this article:

URL:

https://www.qk.sjtu.edu.cn/ktfy/EN/ OR https://www.qk.sjtu.edu.cn/ktfy/EN/Y2025/V8/I6/121