×
模态框(Modal)标题
在这里添加一些文本
Close
Close
Submit
Cancel
Confirm
×
模态框(Modal)标题
×
Search
Citation
Fig/Tab
Adv Search
Home
中文
Reward Function Design Method for Long Episode Pursuit Tasks Under Polar Coordinate in Multi-Agent Reinforcement Learning
DONG Yubo
1
(董玉博), CUI Tao
1
(崔涛), ZHOU Yufan
1
(周禹帆), SONG Xun
2
(宋勋), ZHU Yue
2
(祝月), DONG Peng
1∗
(董鹏)
J Shanghai Jiaotong Univ Sci . 2024, (
4
): 646 -655 . DOI: 10.1007/s12204-024-2713-4
Copyright © 2015 Journal of Shanghai Jiaotong University (Agricultural Sciences), All Rights Reserved.
Powered by Beijing Magtech Co. Ltd