已选(0)清除
条数/页: 排序方式: |
| Joint caching and transmission in the mobile edge network: An multi-agent learning approach 会议论文 , Madrid, Spain, 2021-12-7 作者: Mi,Qirui; Yang,Ning; Zhang,Haifeng; Zhang,Haijun; Wang,Jun Adobe PDF(1724Kb)  |  收藏  |  浏览/下载:19/6  |  提交时间:2024/06/05 |
| MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning 会议论文 , Shenzhen, China, 18-22 July 2021 作者: Zhiwei Xu; Dapeng Li; Yunpeng Bai; Guoliang Fan Adobe PDF(3892Kb)  |  收藏  |  浏览/下载:8/3  |  提交时间:2024/05/28 |
| PEAN: 3D Hand Pose Estimation Adversarial Network 会议论文 , Milan, Italy, 2021-1 作者: Linhui Sun; Yifan Zhang; Jian Cheng; Hanqing Lu Adobe PDF(1613Kb)  |  收藏  |  浏览/下载:85/21  |  提交时间:2024/01/22 |
| Omnidirectional Drift Control of an Underwater Biomimetic Vehicle-Manipulator System via Reinforcement Learning 会议论文 , Suzhou, China, May 14-16, 2021 作者: Ma, Ruichen; Wang, Yu; Wang, Rui; Wang, Shuo Adobe PDF(855Kb)  |  收藏  |  浏览/下载:94/34  |  提交时间:2023/08/02 Omnidirectional Drift Control Undulating Fin Underwater Biomimetic Vehicle-manipulator System (UBVMS) Reinforcement Learning Twin Delayed Deep Deterministic policy gradient (TD3) |
| AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文 , 线上, 2022-02-22 作者: Zhao EM(赵恩民); Yan RY(闫仁业); Li JQ(李金秋); Li K(李凯); Xing JL(兴军亮) Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:172/66  |  提交时间:2023/06/29 |
| Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文 , 线上, 2021-02 作者: Zhao EM(赵恩民); Yan RY(闫仁业); Li K(李凯); Li LJ(李丽娟); Xing JL(兴军亮) Adobe PDF(413Kb)  |  收藏  |  浏览/下载:153/56  |  提交时间:2023/06/28 |
| Locomotion Control of a Hybrid Propulsion Biomimetic Underwater Vehicle via Deep Reinforcement Learning 会议论文 , Xining, China, 15-19 July 2021 作者: Zhang Tiandong; Wang Rui; Wang Yu; Wang Shuo Adobe PDF(1244Kb)  |  收藏  |  浏览/下载:66/23  |  提交时间:2023/06/14 |
| Hierarchical Cooperative Swarm Policy Learning with Role Emergence 会议论文 , Online, 05-07 December 2021 作者: Zhang TL(张天乐); Liu Z(刘振); Pu ZQ(蒲志强); Qiu TH(丘腾海); Yi JQ(易建强) Adobe PDF(327Kb)  |  收藏  |  浏览/下载:139/60  |  提交时间:2023/06/12 |
| Semantic Perception Swarm Policy with Deep Reinforcement Learning 会议论文 , Online, 05 December 2021 作者: Zhang TL(张天乐); Liu Z(刘振); Pu ZQ(蒲志强); Yi JQ(易建强) Adobe PDF(523Kb)  |  收藏  |  浏览/下载:119/48  |  提交时间:2023/06/12 |
| Wide-Sense Stationary Policy Optimization with Bellman Residual on Video Games 会议论文 , Shenzhen, China, 05-09 July 2021 作者: Gong C(龚晨); He Q(何强); Bai YP(白云鹏); Hou XW(侯新文); Fan GL(范国梁); Liu Y(刘禹) Adobe PDF(2780Kb)  |  收藏  |  浏览/下载:234/41  |  提交时间:2022/06/27 Video Game Reinforcement Learning Quantile Regression Bellman residual Wasserstein Distance |