已选(0)清除
条数/页: 排序方式: |
| AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文 , 线上, 2022-02-22 作者: Zhao EM(赵恩民); Yan RY(闫仁业); Li JQ(李金秋); Li K(李凯); Xing JL(兴军亮) Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:126/52  |  提交时间:2023/06/29 |
| Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文 , 线上, 2021-02 作者: Zhao EM(赵恩民); Yan RY(闫仁业); Li K(李凯); Li LJ(李丽娟); Xing JL(兴军亮) Adobe PDF(413Kb)  |  收藏  |  浏览/下载:136/52  |  提交时间:2023/06/28 |
| Hierarchical Cooperative Swarm Policy Learning with Role Emergence 会议论文 , Online, 05-07 December 2021 作者: Zhang TL(张天乐); Liu Z(刘振); Pu ZQ(蒲志强); Qiu TH(丘腾海); Yi JQ(易建强) Adobe PDF(327Kb)  |  收藏  |  浏览/下载:121/56  |  提交时间:2023/06/12 |
| Semantic Perception Swarm Policy with Deep Reinforcement Learning 会议论文 , Online, 05 December 2021 作者: Zhang TL(张天乐); Liu Z(刘振); Pu ZQ(蒲志强); Yi JQ(易建强) Adobe PDF(523Kb)  |  收藏  |  浏览/下载:97/40  |  提交时间:2023/06/12 |
| Multi-agent Collaborative Learning with Relational Graph Reasoning in Adversarial Environments 会议论文 , 线上会议, 2021-9 作者: Wu Shiguang; Qiu Tenghai; Pu Zhiqiang; Yi Jianqiang Adobe PDF(1396Kb)  |  收藏  |  浏览/下载:229/67  |  提交时间:2022/06/16 |
| Multi-target Coverage with Connectivity Maintenance using Knowledge-incorporated Policy Framework 会议论文 , Xi'an China, May 31-Jun. 4 作者: Shiguang Wu; Zhiqiang Pu; Zhen Liu; Tenghai Qiu; Jianqiang Yi; Tianle Zhang Adobe PDF(12862Kb)  |  收藏  |  浏览/下载:253/37  |  提交时间:2022/04/06 |
| Neuro-Optimal Trajectory Tracking With Value Iteration of Discrete-Time Nonlinear Dynamics 期刊论文 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12 作者: Wang, Ding; Ha, Mingming; Cheng, Long 收藏  |  浏览/下载:252/0  |  提交时间:2022/01/27 Trajectory Heuristic algorithms Convergence Trajectory tracking Stability criteria Optimal control Dynamic programming Adaptive critic design discrete-time nonlinear plants neuro-optimal trajectory tracking uniformly ultimately bounded stability value iteration |
| Target Tracking Control of a Biomimetic Underwater Vehicle Through Deep Reinforcement Learning 期刊论文 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12 作者: Wang, Yu; Tang, Chong; Wang, Shuo; Cheng, Long; Wang, Rui; Tan, Min; Hou, Zengguang 收藏  |  浏览/下载:212/0  |  提交时间:2022/01/27 Reinforcement learning Target tracking Robots Sports Aerospace electronics Mobile robots Underwater vehicles Biomimetic underwater vehicle (BUV) reinforcement learning target tracking control |
| 仿生滑翔机器鲸鲨的运动控制与自主对接充电研究 学位论文 , 北京: 中国科学院大学, 2021 作者: 董会杰 Adobe PDF(7686Kb)  |  收藏  |  浏览/下载:280/15  |  提交时间:2021/12/31 仿生滑翔机器鲸鲨 滑翔效率优化 滑翔运动控制 自主对接充电 |
| 基于深度强化学习的群体协同决策关键问题研究 学位论文 , 中国科学院大学: 中国科学院大学人工智能学院, 2021 作者: 王彗木 Adobe PDF(8945Kb)  |  收藏  |  浏览/下载:285/1  |  提交时间:2021/06/24 群体系统 协同决策 多智能体系统 深度强化学习 图卷积网络 注 意力机制 |