CASIA OpenIR

浏览/检索结果: 共14条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文
, 线上, 2022-02-22
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li JQ(李金秋);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:126/52  |  提交时间:2023/06/29
Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文
, 线上, 2021-02
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li K(李凯);  Li LJ(李丽娟);  Xing JL(兴军亮)
Adobe PDF(413Kb)  |  收藏  |  浏览/下载:136/52  |  提交时间:2023/06/28
Hierarchical Cooperative Swarm Policy Learning with Role Emergence 会议论文
, Online, 05-07 December 2021
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Qiu TH(丘腾海);  Yi JQ(易建强)
Adobe PDF(327Kb)  |  收藏  |  浏览/下载:121/56  |  提交时间:2023/06/12
Semantic Perception Swarm Policy with Deep Reinforcement Learning 会议论文
, Online, 05 December 2021
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(523Kb)  |  收藏  |  浏览/下载:97/40  |  提交时间:2023/06/12
Multi-agent Collaborative Learning with Relational Graph Reasoning in Adversarial Environments 会议论文
, 线上会议, 2021-9
作者:  Wu Shiguang;  Qiu Tenghai;  Pu Zhiqiang;  Yi Jianqiang
Adobe PDF(1396Kb)  |  收藏  |  浏览/下载:229/67  |  提交时间:2022/06/16
Multi-target Coverage with Connectivity Maintenance using Knowledge-incorporated Policy Framework 会议论文
, Xi'an China, May 31-Jun. 4
作者:  Shiguang Wu;  Zhiqiang Pu;  Zhen Liu;  Tenghai Qiu;  Jianqiang Yi;  Tianle Zhang
Adobe PDF(12862Kb)  |  收藏  |  浏览/下载:253/37  |  提交时间:2022/04/06
Neuro-Optimal Trajectory Tracking With Value Iteration of Discrete-Time Nonlinear Dynamics 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Wang, Ding;  Ha, Mingming;  Cheng, Long
收藏  |  浏览/下载:252/0  |  提交时间:2022/01/27
Trajectory  Heuristic algorithms  Convergence  Trajectory tracking  Stability criteria  Optimal control  Dynamic programming  Adaptive critic design  discrete-time nonlinear plants  neuro-optimal trajectory tracking  uniformly ultimately bounded stability  value iteration  
Target Tracking Control of a Biomimetic Underwater Vehicle Through Deep Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Wang, Yu;  Tang, Chong;  Wang, Shuo;  Cheng, Long;  Wang, Rui;  Tan, Min;  Hou, Zengguang
收藏  |  浏览/下载:212/0  |  提交时间:2022/01/27
Reinforcement learning  Target tracking  Robots  Sports  Aerospace electronics  Mobile robots  Underwater vehicles  Biomimetic underwater vehicle (BUV)  reinforcement learning  target tracking control  
仿生滑翔机器鲸鲨的运动控制与自主对接充电研究 学位论文
, 北京: 中国科学院大学, 2021
作者:  董会杰
Adobe PDF(7686Kb)  |  收藏  |  浏览/下载:280/15  |  提交时间:2021/12/31
仿生滑翔机器鲸鲨  滑翔效率优化  滑翔运动控制  自主对接充电  
基于深度强化学习的群体协同决策关键问题研究 学位论文
, 中国科学院大学: 中国科学院大学人工智能学院, 2021
作者:  王彗木
Adobe PDF(8945Kb)  |  收藏  |  浏览/下载:285/1  |  提交时间:2021/06/24
群体系统  协同决策  多智能体系统  深度强化学习  图卷积网络  注 意力机制