CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共6条,第1-6条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Soft Contrastive Learning with Q-irrelevance Abstraction for Reinforcement Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2022, 页码: doi={10.1109/TCDS.2022.3218940}
作者:  Minsong Liu;  Luntong Li;  Shuai Hao;  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(12013Kb)  |  收藏  |  浏览/下载:68/18  |  提交时间:2023/04/26
Multi-Agent Reinforcement Learning Based on Clustering in Two-Player Games 会议论文
, Xiamen, China, 2019-12-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(488Kb)  |  收藏  |  浏览/下载:105/35  |  提交时间:2023/06/28
reinforcement learning  unsupervised clustering  matrix game  
ADP with MCTS algorithm for Gomoku 会议论文
, Athens, Greece, 6-9 Dec. 2016
作者:  Tang Zhentao;  Zhao Dongbin;  Shao Kun;  Lv Le
浏览  |  Adobe PDF(866Kb)  |  收藏  |  浏览/下载:651/304  |  提交时间:2017/05/08
Clique-based cooperative multiagent reinforcement learning using factor graphs 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2015, 卷号: 3, 期号: 1, 页码: 248-256
作者:  Zhang,Zhen;  Zhao DB(赵冬斌)
Adobe PDF(707Kb)  |  收藏  |  浏览/下载:198/83  |  提交时间:2017/12/30
Reinforcement Learning  Factor Graphs  
Online reinforcement learning for continuous-state systems 专著章节/文集论文
出自: Frontiers of Intelligent Control and Information Processing, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore:World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, 2014
作者:  Yuanheng Zhu;  Zhao DB(赵冬斌)
Adobe PDF(24150Kb)  |  收藏  |  浏览/下载:240/26  |  提交时间:2017/09/13
DHP Method for Ramp Metering of Freeway Traffic 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2011, 卷号: 12, 期号: 4, 页码: 990-999
作者:  Zhao, Dongbin;  Bai, Xuerui;  Wang, Fei-Yue;  Xu, Jing;  Yu, Wensheng;  Fei-Yue Wang
浏览  |  Adobe PDF(827Kb)  |  收藏  |  浏览/下载:236/73  |  提交时间:2015/08/12
Congestion  Dual Heuristic Programming (Dhp)  Ramp Metering  Traffic Control