CASIA OpenIR

浏览/检索结果: 共47条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文
, 线上, 2020-5
作者:  Liu MS(刘民颂);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(727Kb)  |  收藏  |  浏览/下载:25/11  |  提交时间:2024/07/04
Gait Learning for 3D Bipedal Robots Based on a Combined Strategy of Hybrid Zero Dynamics Feedback Control and Periodic Reward 会议论文
, 中国湖南长沙, 2024-5-25
作者:  Cui LZ(崔凌志);  Tianqi Deng;  Lihua Ma;  Wenhao He
Adobe PDF(690Kb)  |  收藏  |  浏览/下载:37/15  |  提交时间:2024/07/01
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:42/16  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:23/11  |  提交时间:2024/06/25
Minimizing Age of Information for Mobile Edge Computing Systems: A Nested Index Approach 会议论文
, Singapore, 2023/8/24-27
作者:  Chen,Shuo;  Yang,Ning;  Zhang,Meng;  Wang,Jun
Adobe PDF(1413Kb)  |  收藏  |  浏览/下载:53/11  |  提交时间:2024/06/05
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Shenzhen, China, 18-22 July 2021
作者:  Zhiwei Xu;  Dapeng Li;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(3892Kb)  |  收藏  |  浏览/下载:23/12  |  提交时间:2024/05/28
Second-Order Global Attention Networks for Graph Classification and Regression 会议论文
, Beijing, China, August 27-28, 2022
作者:  Hu Fenyu;  Cui Zeyu;  Wu Shu;  Liu Qiang;  Wu Jinlin;  Wang Liang;  Tan Tieniu
Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:229/74  |  提交时间:2023/07/06
Multi-Objective Bayesian Optimization using Deep Gaussian Processes with Applications to Copper Smelting Optimization 会议论文
, 新加坡, 2022-12
作者:  Kang, Liwen;  Wang, Xuelei;  Wu, Zhiheng;  Wang, Ruihua
Adobe PDF(607Kb)  |  收藏  |  浏览/下载:153/41  |  提交时间:2023/06/29
Consensus Control of Multi-Agent Systems With Two-Way Switching Directed Topology 会议论文
, 北京, 2020-12-5
作者:  Wang Xin;  Wei Qinglai;  Song Ruizhuo
Adobe PDF(898Kb)  |  收藏  |  浏览/下载:108/43  |  提交时间:2023/06/28
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:134/38  |  提交时间:2023/06/27