CASIA OpenIR

浏览/检索结果: 共61条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:34/13  |  提交时间:2024/06/25
强化学习,分层强化学习  
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:43/16  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning  
A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning 会议论文
, Padua, Italy, 2022年07月
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Wanmai Yuan
Adobe PDF(2650Kb)  |  收藏  |  浏览/下载:40/12  |  提交时间:2024/06/05
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:40/12  |  提交时间:2024/06/05
Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文
Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8
作者:  He SQ(何少钦);  Gao Y(高阳);  Zhang BF(张保丰);  Chang H(常惠);  Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:58/21  |  提交时间:2024/05/31
Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.  
Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文
, Queensland, Australia, 2023-6
作者:  Yang, Chen;  Yang, Guangkai;  Chen, Hao;  Zhang, Junge
Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:58/22  |  提交时间:2024/05/29
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文
, New Orleans, LA, USA, December 10-16, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Guangchong Zhou;  Zeren Zhang;  Guoliang Fan
Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:53/15  |  提交时间:2024/05/28
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Zeren Zhang;  Guangchong Zhou;  Hao Chen;  Guoliang Fan
Adobe PDF(4141Kb)  |  收藏  |  浏览/下载:45/18  |  提交时间:2024/05/28
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Shenzhen, China, 18-22 July 2021
作者:  Zhiwei Xu;  Dapeng Li;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(3892Kb)  |  收藏  |  浏览/下载:21/11  |  提交时间:2024/05/28
A Performance Optimization Strategy Based on Improved NSGA-II for a Flexible Robotic Fish 会议论文
, 英国伦敦, 2023.5.29
作者:  Lu, Ben;  Wang, Jian;  Liao, Xiaocun;  Zou, Qianqian;  Tan, Min;  Zhou, Chao
Adobe PDF(1449Kb)  |  收藏  |  浏览/下载:65/17  |  提交时间:2024/05/28