已选(0)清除
条数/页: 排序方式: |
| Joint caching and transmission in the mobile edge network: An multi-agent learning approach 会议论文 , Madrid, Spain, 2021-12-7 作者: Mi,Qirui; Yang,Ning ; Zhang,Haifeng; Zhang,Haijun; Wang,Jun
Adobe PDF(1724Kb)  |   收藏  |  浏览/下载:44/11  |  提交时间:2024/06/05 |
| MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning 会议论文 , Shenzhen, China, 18-22 July 2021 作者: Zhiwei Xu ; Dapeng Li ; Yunpeng Bai ; Guoliang Fan![](/image/person.jpg)
Adobe PDF(3892Kb)  |   收藏  |  浏览/下载:23/12  |  提交时间:2024/05/28 |
| Omnidirectional Drift Control of an Underwater Biomimetic Vehicle-Manipulator System via Reinforcement Learning 会议论文 , Suzhou, China, May 14-16, 2021 作者: Ma, Ruichen ; Wang, Yu ; Wang, Rui ; Wang, Shuo![](/image/person.jpg)
Adobe PDF(855Kb)  |   收藏  |  浏览/下载:123/43  |  提交时间:2023/08/02 Omnidirectional Drift Control Undulating Fin Underwater Biomimetic Vehicle-manipulator System (UBVMS) Reinforcement Learning Twin Delayed Deep Deterministic policy gradient (TD3) |
| AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文 , 线上, 2022-02-22 作者: Zhao EM(赵恩民) ; Yan RY(闫仁业); Li JQ(李金秋); Li K(李凯) ; Xing JL(兴军亮)![](/image/person.jpg)
Adobe PDF(2593Kb)  |   收藏  |  浏览/下载:214/79  |  提交时间:2023/06/29 |
| Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文 , 线上, 2021-02 作者: Zhao EM(赵恩民) ; Yan RY(闫仁业); Li K(李凯) ; Li LJ(李丽娟) ; Xing JL(兴军亮)![](/image/person.jpg)
Adobe PDF(413Kb)  |   收藏  |  浏览/下载:186/66  |  提交时间:2023/06/28 |
| Locomotion Control of a Hybrid Propulsion Biomimetic Underwater Vehicle via Deep Reinforcement Learning 会议论文 , Xining, China, 15-19 July 2021 作者: Zhang Tiandong ; Wang Rui ; Wang Yu ; Wang Shuo![](/image/person.jpg)
Adobe PDF(1244Kb)  |   收藏  |  浏览/下载:91/32  |  提交时间:2023/06/14 |
| Hierarchical Cooperative Swarm Policy Learning with Role Emergence 会议论文 , Online, 05-07 December 2021 作者: Zhang TL(张天乐) ; Liu Z(刘振) ; Pu ZQ(蒲志强) ; Qiu TH(丘腾海) ; Yi JQ(易建强)![](/image/person.jpg)
Adobe PDF(327Kb)  |   收藏  |  浏览/下载:162/69  |  提交时间:2023/06/12 |
| Semantic Perception Swarm Policy with Deep Reinforcement Learning 会议论文 , Online, 05 December 2021 作者: Zhang TL(张天乐) ; Liu Z(刘振) ; Pu ZQ(蒲志强) ; Yi JQ(易建强)![](/image/person.jpg)
Adobe PDF(523Kb)  |   收藏  |  浏览/下载:139/55  |  提交时间:2023/06/12 |
| Wide-Sense Stationary Policy Optimization with Bellman Residual on Video Games 会议论文 , Shenzhen, China, 05-09 July 2021 作者: Gong C(龚晨) ; He Q(何强) ; Bai YP(白云鹏) ; Hou XW(侯新文) ; Fan GL(范国梁) ; Liu Y(刘禹)![](/image/person.jpg)
Adobe PDF(2780Kb)  |   收藏  |  浏览/下载:252/45  |  提交时间:2022/06/27 Video Game Reinforcement Learning Quantile Regression Bellman residual Wasserstein Distance |
| Multi-agent Collaborative Learning with Relational Graph Reasoning in Adversarial Environments 会议论文 , 线上会议, 2021-9 作者: Wu Shiguang ; Qiu Tenghai ; Pu Zhiqiang ; Yi Jianqiang![](/image/person.jpg)
Adobe PDF(1396Kb)  |   收藏  |  浏览/下载:275/80  |  提交时间:2022/06/16 |