CASIA OpenIR

浏览/检索结果: 共35条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文
, 线上, 2020-5
作者:  Liu MS(刘民颂);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(727Kb)  |  收藏  |  浏览/下载:12/6  |  提交时间:2024/07/04
Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking 会议论文
, Virtual, United States, 2020-06-14至2020-06-19
作者:  Gao, Jin;  Hu, Weiming;  Lu, Yan
Adobe PDF(468Kb)  |  收藏  |  浏览/下载:38/13  |  提交时间:2024/06/21
Consensus Control of Multi-Agent Systems With Two-Way Switching Directed Topology 会议论文
, 北京, 2020-12-5
作者:  Wang Xin;  Wei Qinglai;  Song Ruizhuo
Adobe PDF(898Kb)  |  收藏  |  浏览/下载:98/40  |  提交时间:2023/06/28
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:125/34  |  提交时间:2023/06/27
Wd3: Taming the estimation bias in deep reinforcement learning 会议论文
, Baltimore, MD, USA, 2020-12
作者:  He Q(何强);  Hou XW(侯新文)
Adobe PDF(2006Kb)  |  收藏  |  浏览/下载:228/45  |  提交时间:2022/06/27
deep reinforcement learning  estimation bias  neural networks  
Clas-Maze: An Edutainment Tool Combining Tangible Programming and Living Knowledge 会议论文
, 线上会议, 2020年11月10日
作者:  Xing Q(邢倩);  Wang DL(王丹力);  Zhao YY(赵燕艳);  Wang XY(王雪钰)
Adobe PDF(1195Kb)  |  收藏  |  浏览/下载:147/35  |  提交时间:2022/06/17
Pedestrian Choice Modeling and Simulation of Staged Evacuation Strategies in Daya Bay Nuclear Power Plant 期刊论文
EEE Transactions on Computational Social Systems, 2020, 卷号: 7, 期号: 3, 页码: 686-695
作者:  Yang, Linyao;  Wang, Xiao;  Zhang, Jun Jason;  Zhou, Min;  Wang, Fei-Yue
Adobe PDF(2409Kb)  |  收藏  |  浏览/下载:170/52  |  提交时间:2022/06/15
Agent-based modeling  exit choice  pedestrian crowd evacuation  random forest (RF)  staged evacuation  
BNAS: Efficient Neural Architecture Search Using Broad Scalable Architecture 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 期号: 0, 页码: 0
作者:  Ding ZX(丁子祥);  Yaran, Chen;  Nannan, Li;  Dingbin, Zhao;  Zhiquan, Sun;  C. L. Philip Chen
Adobe PDF(2713Kb)  |  收藏  |  浏览/下载:194/45  |  提交时间:2022/01/06
Broad convolutional neural network (BCNN), image classification, neural architecture search (NAS), reinforcement learning (RL)  
Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 12, 页码: 5245-5256
作者:  Wei, Qinglai;  Wang, Lingxiao;  Liu, Yu;  Polycarpou, Marios M.
Adobe PDF(4019Kb)  |  收藏  |  浏览/下载:371/84  |  提交时间:2021/03/08
Elevators  Optimal control  Backpropagation  Machine learning  Neural networks  Learning (artificial intelligence)  Actor  –critic  adaptive dynamic programming  deep learning (DL)  elevator group control (EGC)  optimal control  reinforcement learning (RL)  
Continuous-Time Time-Varying Policy Iteration 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 卷号: 50, 期号: 12, 页码: 4958-4971
作者:  Wei, Qinglai;  Liao, Zehua;  Yang, Zhanyu;  Li, Benkai;  Liu, Derong
Adobe PDF(3149Kb)  |  收藏  |  浏览/下载:294/54  |  提交时间:2021/03/02
Optimal control  Nonlinear systems  Time-varying systems  Mathematical model  Dynamic programming  Approximation algorithms  Iterative algorithms  Adaptive critic designs  adaptive dynamic programming (ADP)  neuro-dynamic programming  nonlinear systems  optimal control  policy iteration