CASIA OpenIR

浏览/检索结果: 共12条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:10/5  |  提交时间:2024/06/25
强化学习,分层强化学习  
Embed Trajectory Imitation in Reinforcement Learning: A Hybrid Method for Autonomous Vehicle Planning 会议论文
/, Orlando, FL, USA, 2023-11
作者:  Wang, Yuxiao;  Dai, Xingyuan;  Wang, Kara;  Ali, Hub;  Zhu, Fenghua
Adobe PDF(1410Kb)  |  收藏  |  浏览/下载:22/6  |  提交时间:2024/06/11
Imitation Learning  Trajectory Planning  Deep Reinforcement Learning  Autonomous Driving  
Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文
Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8
作者:  He SQ(何少钦);  Gao Y(高阳);  Zhang BF(张保丰);  Chang H(常惠);  Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:41/12  |  提交时间:2024/05/31
Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.  
Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文
, Queensland, Australia, 2023-6
作者:  Yang, Chen;  Yang, Guangkai;  Chen, Hao;  Zhang, Junge
Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:41/16  |  提交时间:2024/05/29
Efficient Hierarchical Reinforcement Learning via Mutual Information Constrained Subgoal Discovery 会议论文
, 长沙, 2023-11
作者:  Kaishen Wang;  Jingqing Ruan;  Qingyang Zhang;  Dengpeng Xing
Adobe PDF(2044Kb)  |  收藏  |  浏览/下载:23/13  |  提交时间:2024/05/28
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文
, New Orleans, LA, USA, December 10-16, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Guangchong Zhou;  Zeren Zhang;  Guoliang Fan
Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:29/7  |  提交时间:2024/05/28
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Zeren Zhang;  Guangchong Zhou;  Hao Chen;  Guoliang Fan
Adobe PDF(4141Kb)  |  收藏  |  浏览/下载:26/8  |  提交时间:2024/05/28
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Yunpeng Bai;  Bin Zhang;  Dapeng Li;  Guoliang Fan
Adobe PDF(3345Kb)  |  收藏  |  浏览/下载:25/6  |  提交时间:2024/05/28
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:192/42  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks 会议论文
, Washington DC, USA, 2023-2-7
作者:  Pei Xu;  Junge Zhang;  Qiyue Yin;  Chao Yu;  Yaodong Yang;  Kaiqi Huang
Adobe PDF(2037Kb)  |  收藏  |  浏览/下载:240/73  |  提交时间:2023/06/19
deep reinforcement learning  sparse reward  exploration  multi-agent