CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共15条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 页码: 1-13
作者:  Guangzheng Hu;  Yuanheng Zhu;  Haoran Li;  Dongbin Zhao
Adobe PDF(2144Kb)  |  收藏  |  浏览/下载:39/7  |  提交时间:2024/06/05
Games  Q-learning  Task analysis  Optimization  Convergence  Training  Nash equilibrium  Multi-agent reinforcement learning  minimax-Q learning  two-team zero-sum Markov games  
Multi-task safe reinforcement learning for navigating intersections in dense traffic 期刊论文
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 卷号: 360, 期号: 17, 页码: 13737-13760
作者:  Liu, Yuqi;  Gao, Yinfeng;  Zhang, Qichao;  Ding, Dawei;  Zhao, Dongbin
Adobe PDF(3095Kb)  |  收藏  |  浏览/下载:83/16  |  提交时间:2024/02/22
Empirical Policy Optimization for n-Player Markov Games 期刊论文
IEEE Transactions on Cybernetics, 2022, 页码: doi={10.1109/TCYB.2022.3179775}
作者:  Yuanheng Zhu;  Weifan Li;  Mengchen Zhao;  Jianye Hao;  Dongbin Zhao
Adobe PDF(1739Kb)  |  收藏  |  浏览/下载:111/44  |  提交时间:2023/04/26
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Adobe PDF(4187Kb)  |  收藏  |  浏览/下载:262/12  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
Heuristic rank selection with progressively searching tensor ring network 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 15
作者:  Li, Nannan;  Pan, Yu;  Chen, Yaran;  Ding, Zixiang;  Zhao, Dongbin;  Xu, Zenglin
Adobe PDF(1305Kb)  |  收藏  |  浏览/下载:313/57  |  提交时间:2021/04/27
Tensor ring networks  Rank selection  Progressively search  Image classification  
CNN-G: convolutional neural network combined with graph for image segmentation with theoretical analysis 期刊论文
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2021, 卷号: 0, 期号: 0, 页码: 0
作者:  Lu, Yi;  Chen, Yaran;  Zhao, Dongbin;  Liu, Bao;  Lai, Zhichao;  Chen, Jianxin
浏览  |  Adobe PDF(5636Kb)  |  收藏  |  浏览/下载:361/148  |  提交时间:2020/10/19
Graph neural network, image segmentation, self-attention, structure pattern learning.  
An Autonomous Driving Experience Platform with Learning-Based Functions 会议论文
, Bangalore, India, 18-21 Nov. 2018
作者:  Li, Dong;  Zhao, Dongbin;  Zhang, Qichao;  Zhu, Yuanheng
浏览  |  Adobe PDF(215Kb)  |  收藏  |  浏览/下载:305/77  |  提交时间:2019/04/25
Policy Iteration for H infinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 2, 页码: 500-509
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Yang, Xiong;  Zhang, Qichao
Adobe PDF(892Kb)  |  收藏  |  浏览/下载:326/51  |  提交时间:2018/10/10
Adaptive Dynamic Programming (Adp)  h Infinity Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)  
Clique-based cooperative multiagent reinforcement learning using factor graphs 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2015, 卷号: 3, 期号: 1, 页码: 248-256
作者:  Zhang,Zhen;  Zhao DB(赵冬斌)
浏览  |  Adobe PDF(707Kb)  |  收藏  |  浏览/下载:235/96  |  提交时间:2017/12/30
Reinforcement Learning  Factor Graphs  
Event-Triggered Adaptive Dynamic Programming for Uncertain Nonlinear Systems 会议论文
, Beijing, China, November 19–23
作者:  Zhang,Qichao;  Zhao,Dongbin;  Wang,Ding
浏览  |  Adobe PDF(153Kb)  |  收藏  |  浏览/下载:214/88  |  提交时间:2017/12/28