CASIA OpenIR

浏览/检索结果: 共18条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Joint caching and transmission in the mobile edge network: An multi-agent learning approach 会议论文
, Madrid, Spain, 2021-12-7
作者:  Mi,Qirui;  Yang,Ning;  Zhang,Haifeng;  Zhang,Haijun;  Wang,Jun
Adobe PDF(1724Kb)  |  收藏  |  浏览/下载:19/6  |  提交时间:2024/06/05
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Shenzhen, China, 18-22 July 2021
作者:  Zhiwei Xu;  Dapeng Li;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(3892Kb)  |  收藏  |  浏览/下载:8/3  |  提交时间:2024/05/28
PEAN: 3D Hand Pose Estimation Adversarial Network 会议论文
, Milan, Italy, 2021-1
作者:  Linhui Sun;  Yifan Zhang;  Jian Cheng;  Hanqing Lu
Adobe PDF(1613Kb)  |  收藏  |  浏览/下载:85/21  |  提交时间:2024/01/22
Omnidirectional Drift Control of an Underwater Biomimetic Vehicle-Manipulator System via Reinforcement Learning 会议论文
, Suzhou, China, May 14-16, 2021
作者:  Ma, Ruichen;  Wang, Yu;  Wang, Rui;  Wang, Shuo
Adobe PDF(855Kb)  |  收藏  |  浏览/下载:94/34  |  提交时间:2023/08/02
Omnidirectional Drift Control  Undulating Fin  Underwater Biomimetic Vehicle-manipulator System (UBVMS)  Reinforcement Learning  Twin Delayed Deep Deterministic policy gradient (TD3)  
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文
, 线上, 2022-02-22
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li JQ(李金秋);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:172/66  |  提交时间:2023/06/29
Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文
, 线上, 2021-02
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li K(李凯);  Li LJ(李丽娟);  Xing JL(兴军亮)
Adobe PDF(413Kb)  |  收藏  |  浏览/下载:153/56  |  提交时间:2023/06/28
Locomotion Control of a Hybrid Propulsion Biomimetic Underwater Vehicle via Deep Reinforcement Learning 会议论文
, Xining, China, 15-19 July 2021
作者:  Zhang Tiandong;  Wang Rui;  Wang Yu;  Wang Shuo
Adobe PDF(1244Kb)  |  收藏  |  浏览/下载:66/23  |  提交时间:2023/06/14
Hierarchical Cooperative Swarm Policy Learning with Role Emergence 会议论文
, Online, 05-07 December 2021
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Qiu TH(丘腾海);  Yi JQ(易建强)
Adobe PDF(327Kb)  |  收藏  |  浏览/下载:139/60  |  提交时间:2023/06/12
Semantic Perception Swarm Policy with Deep Reinforcement Learning 会议论文
, Online, 05 December 2021
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(523Kb)  |  收藏  |  浏览/下载:119/48  |  提交时间:2023/06/12
Wide-Sense Stationary Policy Optimization with Bellman Residual on Video Games 会议论文
, Shenzhen, China, 05-09 July 2021
作者:  Gong C(龚晨);  He Q(何强);  Bai YP(白云鹏);  Hou XW(侯新文);  Fan GL(范国梁);  Liu Y(刘禹)
Adobe PDF(2780Kb)  |  收藏  |  浏览/下载:234/41  |  提交时间:2022/06/27
Video Game  Reinforcement Learning  Quantile Regression  Bellman residual  Wasserstein Distance