CASIA OpenIR

浏览/检索结果: 共6条,第1-6条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
A c-MET-Targeted Topical Fluorescent Probe cMBP-ICG Improves Oral Squamous Cell Carcinoma Detection in Humans 期刊论文
ANNALS OF SURGICAL ONCOLOGY, 2022, 页码: 11
作者:  Wang, Jingbo;  Li, Siyi;  Wang, Kun;  Zhu, Ling;  Yang, Lin;  Zhu, Yunjing;  Zhang, Zhen;  Hu, Longwei;  Yuan, Yuan;  Fan, Qi;  Ren, Jiliang;  Yang, Gongxin;  Ding, Weilong;  Zhou, Xiaoyu;  Cui, Junqi;  Zhang, Chunye;  Yuan, Ying;  Huang, Ruimin;  Tian, Jie;  Tao, Xiaofeng
收藏  |  浏览/下载:148/0  |  提交时间:2022/11/14
A Gradient-Based Reinforcement Learning Algorithm for Multiple Cooperative Agents 期刊论文
IEEE ACCESS, 2018, 卷号: 6, 页码: 70223-70235
作者:  Zhang, Zhen;  Wang, Dongqing;  Zhao, Dongbin;  Han, Qiaoni;  Song, Tingting
收藏  |  浏览/下载:226/0  |  提交时间:2019/07/12
Multi-agent reinforcement learning  gradient ascent  Q-learning  cooperative tasks  
FMRQ-A Multiagent Reinforcement Learning Algorithm for Fully Cooperative Tasks 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 6, 页码: 1367-1379
作者:  Zhang, Zhen;  Zhao, Dongbin;  Gao, Junwei;  Wang, Dongqing;  Dai, Yujie
收藏  |  浏览/下载:280/0  |  提交时间:2017/07/18
Multiagent Reinforcement Learning (Marl)  Nash Equilibrium  Q-learning  Repeated Game  
Clique-based cooperative multiagent reinforcement learning using factor graphs 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2015, 卷号: 3, 期号: 1, 页码: 248-256
作者:  Zhang,Zhen;  Zhao DB(赵冬斌)
浏览  |  Adobe PDF(707Kb)  |  收藏  |  浏览/下载:195/82  |  提交时间:2017/12/30
Reinforcement Learning  Factor Graphs  
Computational Intelligence in Urban Traffic Signal Control: A Survey 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 卷号: 42, 期号: 4, 页码: 485-494
作者:  Zhao, Dongbin;  Dai, Yujie;  Zhang, Zhen
收藏  |  浏览/下载:111/0  |  提交时间:2015/08/12
Computational Intelligence (Ci)  Freeway Network  Surface-way Network  Traffic Congestions  Traffic Signal Control (Tsc)  
Self-teaching adaptive dynamic programming for Gomoku 期刊论文
NEUROCOMPUTING, 2012, 卷号: 78, 期号: 1, 页码: 23-29
作者:  Zhao, Dongbin;  Zhang, Zhen;  Dai, Yujie
收藏  |  浏览/下载:187/0  |  提交时间:2015/08/12
Gomoku  Reinforcement Learning  Adaptive Dynamic Programming  Temporal Difference Learning  Neural Network