CASIA OpenIR

Browse/Search Results:  1-7 of 7 Help

Selected(0)Clear Items/Page:    Sort:
A Gradient-Based Reinforcement Learning Algorithm for Multiple Cooperative Agents 期刊论文
IEEE ACCESS, 2018, 卷号: 6, 页码: 70223-70235
Authors:  Zhang, Zhen;  Wang, Dongqing;  Zhao, Dongbin;  Han, Qiaoni;  Song, Tingting
Favorite  |  View/Download:13/0  |  Submit date:2019/07/12
Multi-agent reinforcement learning  gradient ascent  Q-learning  cooperative tasks  
FMRQ-A Multiagent Reinforcement Learning Algorithm for Fully Cooperative Tasks 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 6, 页码: 1367-1379
Authors:  Zhang, Zhen;  Zhao, Dongbin;  Gao, Junwei;  Wang, Dongqing;  Dai, Yujie
Favorite  |  View/Download:98/0  |  Submit date:2017/07/18
Multiagent Reinforcement Learning (Marl)  Nash Equilibrium  Q-learning  Repeated Game  
Clique-based cooperative multiagent reinforcement learning using factor graphs 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2015, 卷号: 3, 期号: 1, 页码: 248-256
Authors:  Zhang,Zhen;  Zhao DB(赵冬斌)
View  |  Adobe PDF(707Kb)  |  Favorite  |  View/Download:37/16  |  Submit date:2017/12/30
Reinforcement Learning  Factor Graphs  
Computational Intelligence in Urban Traffic Signal Control: A Survey 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 卷号: 42, 期号: 4, 页码: 485-494
Authors:  Zhao, Dongbin;  Dai, Yujie;  Zhang, Zhen
Favorite  |  View/Download:36/0  |  Submit date:2015/08/12
Computational Intelligence (Ci)  Freeway Network  Surface-way Network  Traffic Congestions  Traffic Signal Control (Tsc)  
Self-teaching adaptive dynamic programming for Gomoku 期刊论文
NEUROCOMPUTING, 2012, 卷号: 78, 期号: 1, 页码: 23-29
Authors:  Zhao, Dongbin;  Zhang, Zhen;  Dai, Yujie
Favorite  |  View/Download:67/0  |  Submit date:2015/08/12
Gomoku  Reinforcement Learning  Adaptive Dynamic Programming  Temporal Difference Learning  Neural Network  
基于强化学习的城市交通信号优化控制 学位论文
, 中国科学院自动化研究所: 中国科学院大学, 2010
Authors:  张震
Adobe PDF(12129Kb)  |  Favorite  |  View/Download:127/0  |  Submit date:2015/09/02
强化学习  交通信号控制  多agent系统  基于基团分解  因子图  一般最大和算法  Reinforcement Learning  Traffic Signal Control  Multiagent Systems  Clique-based Decomposition  Factor Graphs  The General Max-plus Algorithm  
多角度人脸检测的统计学习研究 学位论文
, 中国科学院自动化研究所: 中国科学院研究生院, 2002
Authors:  张震球
Favorite  |  View/Download:31/0  |  Submit date:2015/09/02
人脸检测