已选(0)清除
条数/页: 排序方式: |
| Joint caching and transmission in the mobile edge network: An multi-agent learning approach 会议论文 , Madrid, Spain, 2021-12-7 作者: Mi,Qirui; Yang,Ning ; Zhang,Haifeng; Zhang,Haijun; Wang,Jun
Adobe PDF(1724Kb)  |   收藏  |  浏览/下载:21/7  |  提交时间:2024/06/05 |
| MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning 会议论文 , Shenzhen, China, 18-22 July 2021 作者: Zhiwei Xu ; Dapeng Li ; Yunpeng Bai ; Guoliang Fan![](/image/person.jpg)
Adobe PDF(3892Kb)  |   收藏  |  浏览/下载:8/3  |  提交时间:2024/05/28 |
| Omnidirectional Drift Control of an Underwater Biomimetic Vehicle-Manipulator System via Reinforcement Learning 会议论文 , Suzhou, China, May 14-16, 2021 作者: Ma, Ruichen ; Wang, Yu ; Wang, Rui ; Wang, Shuo![](/image/person.jpg)
Adobe PDF(855Kb)  |   收藏  |  浏览/下载:97/34  |  提交时间:2023/08/02 Omnidirectional Drift Control Undulating Fin Underwater Biomimetic Vehicle-manipulator System (UBVMS) Reinforcement Learning Twin Delayed Deep Deterministic policy gradient (TD3) |
| AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文 , 线上, 2022-02-22 作者: Zhao EM(赵恩民) ; Yan RY(闫仁业); Li JQ(李金秋); Li K(李凯) ; Xing JL(兴军亮)![](/image/person.jpg)
Adobe PDF(2593Kb)  |   收藏  |  浏览/下载:175/68  |  提交时间:2023/06/29 |
| Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文 , 线上, 2021-02 作者: Zhao EM(赵恩民) ; Yan RY(闫仁业); Li K(李凯) ; Li LJ(李丽娟) ; Xing JL(兴军亮)![](/image/person.jpg)
Adobe PDF(413Kb)  |   收藏  |  浏览/下载:157/56  |  提交时间:2023/06/28 |
| Locomotion Control of a Hybrid Propulsion Biomimetic Underwater Vehicle via Deep Reinforcement Learning 会议论文 , Xining, China, 15-19 July 2021 作者: Zhang Tiandong ; Wang Rui ; Wang Yu ; Wang Shuo![](/image/person.jpg)
Adobe PDF(1244Kb)  |   收藏  |  浏览/下载:69/23  |  提交时间:2023/06/14 |
| Hierarchical Cooperative Swarm Policy Learning with Role Emergence 会议论文 , Online, 05-07 December 2021 作者: Zhang TL(张天乐) ; Liu Z(刘振) ; Pu ZQ(蒲志强) ; Qiu TH(丘腾海) ; Yi JQ(易建强)![](/image/person.jpg)
Adobe PDF(327Kb)  |   收藏  |  浏览/下载:142/60  |  提交时间:2023/06/12 |
| Semantic Perception Swarm Policy with Deep Reinforcement Learning 会议论文 , Online, 05 December 2021 作者: Zhang TL(张天乐) ; Liu Z(刘振) ; Pu ZQ(蒲志强) ; Yi JQ(易建强)![](/image/person.jpg)
Adobe PDF(523Kb)  |   收藏  |  浏览/下载:123/50  |  提交时间:2023/06/12 |
| Benchmarking lane-changing decision-making for deep reinforcement learning 会议论文 , Guangzhou, China, 2021-11 作者: Wang JJ(王俊杰) ; Zhang QC(张启超) ; Zhao DB(赵冬斌)![](/image/person.jpg)
Adobe PDF(1117Kb)  |   收藏  |  浏览/下载:136/47  |  提交时间:2023/05/30 |
| Wide-Sense Stationary Policy Optimization with Bellman Residual on Video Games 会议论文 , Shenzhen, China, 05-09 July 2021 作者: Gong C(龚晨) ; He Q(何强) ; Bai YP(白云鹏) ; Hou XW(侯新文) ; Fan GL(范国梁) ; Liu Y(刘禹)![](/image/person.jpg)
Adobe PDF(2780Kb)  |   收藏  |  浏览/下载:237/42  |  提交时间:2022/06/27 Video Game Reinforcement Learning Quantile Regression Bellman residual Wasserstein Distance |