已选(0)清除
条数/页: 排序方式: |
| Joint caching and transmission in the mobile edge network: An multi-agent learning approach 会议论文 , Madrid, Spain, 2021-12-7 作者: Mi,Qirui; Yang,Ning; Zhang,Haifeng; Zhang,Haijun; Wang,Jun Adobe PDF(1724Kb)  |  收藏  |  浏览/下载:44/11  |  提交时间:2024/06/05 |
| MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning 会议论文 , Shenzhen, China, 18-22 July 2021 作者: Zhiwei Xu; Dapeng Li; Yunpeng Bai; Guoliang Fan Adobe PDF(3892Kb)  |  收藏  |  浏览/下载:23/12  |  提交时间:2024/05/28 |
| Omnidirectional Drift Control of an Underwater Biomimetic Vehicle-Manipulator System via Reinforcement Learning 会议论文 , Suzhou, China, May 14-16, 2021 作者: Ma, Ruichen; Wang, Yu; Wang, Rui; Wang, Shuo Adobe PDF(855Kb)  |  收藏  |  浏览/下载:123/43  |  提交时间:2023/08/02 Omnidirectional Drift Control Undulating Fin Underwater Biomimetic Vehicle-manipulator System (UBVMS) Reinforcement Learning Twin Delayed Deep Deterministic policy gradient (TD3) |
| AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文 , 线上, 2022-02-22 作者: Zhao EM(赵恩民); Yan RY(闫仁业); Li JQ(李金秋); Li K(李凯); Xing JL(兴军亮) Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:214/79  |  提交时间:2023/06/29 |
| Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文 , 线上, 2021-02 作者: Zhao EM(赵恩民); Yan RY(闫仁业); Li K(李凯); Li LJ(李丽娟); Xing JL(兴军亮) Adobe PDF(413Kb)  |  收藏  |  浏览/下载:186/66  |  提交时间:2023/06/28 |
| Locomotion Control of a Hybrid Propulsion Biomimetic Underwater Vehicle via Deep Reinforcement Learning 会议论文 , Xining, China, 15-19 July 2021 作者: Zhang Tiandong; Wang Rui; Wang Yu; Wang Shuo Adobe PDF(1244Kb)  |  收藏  |  浏览/下载:91/32  |  提交时间:2023/06/14 |
| Hierarchical Cooperative Swarm Policy Learning with Role Emergence 会议论文 , Online, 05-07 December 2021 作者: Zhang TL(张天乐); Liu Z(刘振); Pu ZQ(蒲志强); Qiu TH(丘腾海); Yi JQ(易建强) Adobe PDF(327Kb)  |  收藏  |  浏览/下载:162/69  |  提交时间:2023/06/12 |
| Semantic Perception Swarm Policy with Deep Reinforcement Learning 会议论文 , Online, 05 December 2021 作者: Zhang TL(张天乐); Liu Z(刘振); Pu ZQ(蒲志强); Yi JQ(易建强) Adobe PDF(523Kb)  |  收藏  |  浏览/下载:139/55  |  提交时间:2023/06/12 |
| Benchmarking lane-changing decision-making for deep reinforcement learning 会议论文 , Guangzhou, China, 2021-11 作者: Wang JJ(王俊杰); Zhang QC(张启超); Zhao DB(赵冬斌) Adobe PDF(1117Kb)  |  收藏  |  浏览/下载:162/55  |  提交时间:2023/05/30 |
| Wide-Sense Stationary Policy Optimization with Bellman Residual on Video Games 会议论文 , Shenzhen, China, 05-09 July 2021 作者: Gong C(龚晨); He Q(何强); Bai YP(白云鹏); Hou XW(侯新文); Fan GL(范国梁); Liu Y(刘禹) Adobe PDF(2780Kb)  |  收藏  |  浏览/下载:252/45  |  提交时间:2022/06/27 Video Game Reinforcement Learning Quantile Regression Bellman residual Wasserstein Distance |