CASIA OpenIR

浏览/检索结果: 共8条,第1-8条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:29/13  |  提交时间:2024/06/25
Bidirectional Sentence Ordering with Interactive Decoding 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing, 2023, 卷号: 22, 期号: 2, 页码: 1-15
作者:  Guirong Bai;  Shizhu HE;  Kang Liu;  Jun Zhao
Adobe PDF(1080Kb)  |  收藏  |  浏览/下载:44/16  |  提交时间:2024/06/20
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:50/15  |  提交时间:2024/06/05
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:164/9  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system  
基于噪声对比估计的权重自适应对抗生成式模仿学习 期刊论文
模式识别与人工智能, 2023, 卷号: 36, 期号: 4, 页码: 300-312
作者:  关伟凡;  张希
Adobe PDF(1849Kb)  |  收藏  |  浏览/下载:150/50  |  提交时间:2023/06/29
强化学习  模仿学习  噪声对比估计  自适应权重  
Robot Subgoal-guided Navigation in Dynamic Crowded Environments with Hierarchical Deep Reinforcement Learning 期刊论文
International Journal of Control, Automation and Systems, 2023, 页码: 1-13
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强);  Liang YY(梁延研);  Zhang D(张渡)
Adobe PDF(6472Kb)  |  收藏  |  浏览/下载:192/42  |  提交时间:2023/06/12
Automatic Curriculum Learning for Large-Scale Cooperative Multiagent Systems 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2023, 卷号: 7, 期号: 3, 页码: 912-930
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(4728Kb)  |  收藏  |  浏览/下载:407/96  |  提交时间:2023/06/02
Multiexperience-Assisted Efficient Multiagent Reinforcement Learning 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2023, 页码: 1-15
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Yi JQ(易建强);  Wu SG(吴士广);  Pu ZQ(蒲志强);  Zhao YJ(赵彦杰)
Adobe PDF(2718Kb)  |  收藏  |  浏览/下载:325/108  |  提交时间:2023/06/02