CASIA OpenIR

浏览/检索结果: 共13条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文
, 线上, 2020-5
作者:  Liu MS(刘民颂);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(727Kb)  |  收藏  |  浏览/下载:23/10  |  提交时间:2024/07/04
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:132/37  |  提交时间:2023/06/27
Wd3: Taming the estimation bias in deep reinforcement learning 会议论文
, Baltimore, MD, USA, 2020-12
作者:  He Q(何强);  Hou XW(侯新文)
Adobe PDF(2006Kb)  |  收藏  |  浏览/下载:235/50  |  提交时间:2022/06/27
deep reinforcement learning  estimation bias  neural networks  
Cyber-Physical-Social Systems for Smart City: An Implementation Based on Intelligent Loop 会议论文
, 北京, 2020-12-5
作者:  Xiong, Gang;  Chen, Xiaoyu;  Shuo, Nan;  Lv, Yisheng;  Zhu, Fenghua;  Qu, Tianci;  Ye, Peijun
Adobe PDF(457Kb)  |  收藏  |  浏览/下载:213/62  |  提交时间:2022/06/16
Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 12, 页码: 5245-5256
作者:  Wei, Qinglai;  Wang, Lingxiao;  Liu, Yu;  Polycarpou, Marios M.
Adobe PDF(4019Kb)  |  收藏  |  浏览/下载:382/86  |  提交时间:2021/03/08
Elevators  Optimal control  Backpropagation  Machine learning  Neural networks  Learning (artificial intelligence)  Actor  –critic  adaptive dynamic programming  deep learning (DL)  elevator group control (EGC)  optimal control  reinforcement learning (RL)  
Neuro-optimal control for discrete stochastic processes via a novel policy iteration algorithm 期刊论文
IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2020, 卷号: 50, 期号: 11, 页码: 3972-3985
作者:  Liang, Mingming;  Wang, Ding;  Liu, Derong
浏览  |  Adobe PDF(1604Kb)  |  收藏  |  浏览/下载:223/74  |  提交时间:2020/10/23
Adaptive critic designs  adaptive dynamic programming (ADP)  local policy iteration  neuro-dynamic programming  optimal control  stochastic processes  
A Novel GSP Auction Mechanism for Dynamic Confirmation Games on Bitcoin Transactions 期刊论文
IEEE Transactions on Services Computing, 2020, 期号: NA, 页码: NA
作者:  Li, Juanjuan;  Ni, Xiaochun;  Yuan, Yong;  Wang, Fei-Yue
浏览  |  Adobe PDF(2436Kb)  |  收藏  |  浏览/下载:225/61  |  提交时间:2020/10/14
Blockchain  Transaction Fee  Bitcoin transaction confirmation game  Generalized Second Price mechanism  
Real-world Robot Reaching Skill Learning Based on Deep Reinforcement Learning 会议论文
, Hefei, China, 2020
作者:  Liu, Naijun;  Lu, Tao;  Cai, Yinghao;  Wang, Rui;  Wang, Shuo
浏览  |  Adobe PDF(436Kb)  |  收藏  |  浏览/下载:188/66  |  提交时间:2020/09/27
ACDER: Augmented Curiosity-Driven Experience Replay 会议论文
, Paris, France, 2020.05.31-2020.08.31
作者:  Li, Boyao;  Lu, Tao;  Li, Jiayi;  Lu, Ning;  Cai, Yinghao;  Wang, Shuo
浏览  |  Adobe PDF(3303Kb)  |  收藏  |  浏览/下载:269/84  |  提交时间:2020/08/27
Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 6, 页码: 2064-2076
作者:  Li, Haoran;  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(4274Kb)  |  收藏  |  浏览/下载:412/126  |  提交时间:2020/08/03
Robot sensing systems  Navigation  Entropy  Neural networks  Task analysis  Planning  Automatic exploration  deep reinforcement learning (DRL)  optimal decision  partial observation