已选(0)清除
条数/页: 排序方式: |
| An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文 , 线上, 2020-5 作者: Liu MS(刘民颂) ; Zhu YH(朱圆恒) ; Zhao DB(赵冬斌)![](/image/person.jpg)
Adobe PDF(727Kb)  |   收藏  |  浏览/下载:25/11  |  提交时间:2024/07/04 |
| Gait Learning for 3D Bipedal Robots Based on a Combined Strategy of Hybrid Zero Dynamics Feedback Control and Periodic Reward 会议论文 , 中国湖南长沙, 2024-5-25 作者: Cui LZ(崔凌志) ; Tianqi Deng; Lihua Ma; Wenhao He
Adobe PDF(690Kb)  |   收藏  |  浏览/下载:37/15  |  提交时间:2024/07/01 |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang ; Yang Yiming ; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(7948Kb)  |   收藏  |  浏览/下载:42/16  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang ; Zhang Hongming; Xing Dengpeng ; Bo Xu![](/image/person.jpg)
Adobe PDF(9639Kb)  |   收藏  |  浏览/下载:23/11  |  提交时间:2024/06/25 |
| Minimizing Age of Information for Mobile Edge Computing Systems: A Nested Index Approach 会议论文 , Singapore, 2023/8/24-27 作者: Chen,Shuo; Yang,Ning ; Zhang,Meng ; Wang,Jun
Adobe PDF(1413Kb)  |   收藏  |  浏览/下载:53/11  |  提交时间:2024/06/05 |
| MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning 会议论文 , Shenzhen, China, 18-22 July 2021 作者: Zhiwei Xu ; Dapeng Li ; Yunpeng Bai ; Guoliang Fan![](/image/person.jpg)
Adobe PDF(3892Kb)  |   收藏  |  浏览/下载:23/12  |  提交时间:2024/05/28 |
| Second-Order Global Attention Networks for Graph Classification and Regression 会议论文 , Beijing, China, August 27-28, 2022 作者: Hu Fenyu ; Cui Zeyu ; Wu Shu ; Liu Qiang; Wu Jinlin ; Wang Liang ; Tan Tieniu![](/image/person.jpg)
Adobe PDF(69424Kb)  |   收藏  |  浏览/下载:229/74  |  提交时间:2023/07/06 |
| Multi-Objective Bayesian Optimization using Deep Gaussian Processes with Applications to Copper Smelting Optimization 会议论文 , 新加坡, 2022-12 作者: Kang, Liwen ; Wang, Xuelei ; Wu, Zhiheng ; Wang, Ruihua![](/image/person.jpg)
Adobe PDF(607Kb)  |   收藏  |  浏览/下载:153/41  |  提交时间:2023/06/29 |
| Consensus Control of Multi-Agent Systems With Two-Way Switching Directed Topology 会议论文 , 北京, 2020-12-5 作者: Wang Xin ; Wei Qinglai ; Song Ruizhuo
Adobe PDF(898Kb)  |   收藏  |  浏览/下载:108/43  |  提交时间:2023/06/28 |
| Stable Training of Bellman Error in Reinforcement Learning 会议论文 , Thailand, November 18–22 作者: Gong C(龚晨) ; Bai YP(白云鹏) ; Hou XW(侯新文) ; Ji XH(季晓慧)
Adobe PDF(2416Kb)  |   收藏  |  浏览/下载:134/38  |  提交时间:2023/06/27 |