CASIA OpenIR

浏览/检索结果: 共3条,第1-3条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
A Gradient-Based Reinforcement Learning Algorithm for Multiple Cooperative Agents 期刊论文
IEEE ACCESS, 2018, 卷号: 6, 页码: 70223-70235
作者:  Zhang, Zhen;  Wang, Dongqing;  Zhao, Dongbin;  Han, Qiaoni;  Song, Tingting
收藏  |  浏览/下载:260/0  |  提交时间:2019/07/12
Multi-agent reinforcement learning  gradient ascent  Q-learning  cooperative tasks  
Policy Iteration Algorithm Based Fault Tolerant Tracking Control: An Implementation on Reconfigurable Manipulators 期刊论文
JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2018, 卷号: 13, 期号: 4, 页码: 1739-1750
作者:  Li, Yuanchun;  Xia, Hongbing;  Zhao, Bo
浏览  |  Adobe PDF(708Kb)  |  收藏  |  浏览/下载:358/59  |  提交时间:2018/10/10
Adaptive dynamic programming  Policy iteration  Fault tolerant tracking control  Reconfigurable manipulators  Neural network  
A Pareto optimal mechanism for demand-side platforms in real time bidding advertising markets 期刊论文
INFORMATION SCIENCES, 2018, 卷号: 469, 页码: 119-140
作者:  Qin, Rui;  Yuan, Yong;  Wang, Fei-Yue
浏览  |  Adobe PDF(1804Kb)  |  收藏  |  浏览/下载:512/161  |  提交时间:2018/09/20
Computational advertising  Real time bidding  Demand side platform  Pareto optimal  Mechanism design  Computational experiment