CASIA OpenIR

浏览/检索结果: 共9条,第1-9条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文
, 线上, 2022-02-22
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li JQ(李金秋);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:159/60  |  提交时间:2023/06/29
Hierarchical Cooperative Swarm Policy Learning with Role Emergence 会议论文
, Online, 05-07 December 2021
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Qiu TH(丘腾海);  Yi JQ(易建强)
Adobe PDF(327Kb)  |  收藏  |  浏览/下载:134/58  |  提交时间:2023/06/12
Multi-agent Collaborative Learning with Relational Graph Reasoning in Adversarial Environments 会议论文
, 线上会议, 2021-9
作者:  Wu Shiguang;  Qiu Tenghai;  Pu Zhiqiang;  Yi Jianqiang
Adobe PDF(1396Kb)  |  收藏  |  浏览/下载:249/74  |  提交时间:2022/06/16
Multi-target Coverage with Connectivity Maintenance using Knowledge-incorporated Policy Framework 会议论文
, Xi'an China, May 31-Jun. 4
作者:  Shiguang Wu;  Zhiqiang Pu;  Zhen Liu;  Tenghai Qiu;  Jianqiang Yi;  Tianle Zhang
Adobe PDF(12862Kb)  |  收藏  |  浏览/下载:269/42  |  提交时间:2022/04/06
Formation control with collision avoidance through deep reinforcement learning using model-guided demonstration 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2021, 卷号: 32, 期号: 6, 页码: 2358-2372
作者:  Zezhi Sui;  Zhiqiang Pu;  Jianqiang Yi;  Shiguang Wu
Adobe PDF(5344Kb)  |  收藏  |  浏览/下载:240/78  |  提交时间:2022/04/02
Collision avoidance  deep reinforcement learning (DRL)  formation control  leader–follower  
Distributed Dynamic Event-Triggered Control for Euler-Lagrange Multiagent Systems With Parametric Uncertainties 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 页码: 13
作者:  Cao, Ran;  Cheng, Long
Adobe PDF(1843Kb)  |  收藏  |  浏览/下载:272/38  |  提交时间:2022/01/27
Vehicle dynamics  Nonlinear dynamical systems  Symmetric matrices  Laplace equations  Heuristic algorithms  Directed graphs  Robot sensing systems  Consensus  containment  dynamic event-triggered control  Euler-Lagrange system  multiagent systems (MASs)  
Fixed-time adaptive observer-based time-varying formation control for multi-agent systems with directed topologies 期刊论文
NEUROCOMPUTING, 2021, 卷号: 463, 页码: 483-494
作者:  Xiong, Tianyi;  Gu, Zhou;  Yi, Jianqiang;  Pu, Zhiqiang
收藏  |  浏览/下载:190/0  |  提交时间:2021/11/04
Directed topologies  Formation control  Fixed-time observer  Multi-agent systems  
Distributed Nash equilibrium seeking for integrated game and control of multi-agent systems with input delay 期刊论文
NONLINEAR DYNAMICS, 2021, 卷号: 106, 页码: 583-603
作者:  Ai, Xiaolin
Adobe PDF(3382Kb)  |  收藏  |  浏览/下载:193/36  |  提交时间:2021/11/03
Nash equilibrium seeking  Integrated game and control  Multi-agent systems  Input delay  Input-to-stable stability  
基于深度强化学习的群体协同决策关键问题研究 学位论文
, 中国科学院大学: 中国科学院大学人工智能学院, 2021
作者:  王彗木
Adobe PDF(8945Kb)  |  收藏  |  浏览/下载:292/1  |  提交时间:2021/06/24
群体系统  协同决策  多智能体系统  深度强化学习  图卷积网络  注 意力机制