CASIA OpenIR

浏览/检索结果: 共13条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Optimal Pedestrian Evacuation in Building with Consecutive Differential Dynamic Programming 会议论文
, Budapest, Hungary, 2019-7-14
作者:  Zhu YH(朱圆恒);  Haibo He;  Dongbin Zhao;  Zhongsheng Hou
Adobe PDF(679Kb)  |  收藏  |  浏览/下载:63/31  |  提交时间:2023/05/22
Time-sequence Action-Decision and Navigation Through Stage Deep Reinforcement Learning in Complex Dynamic Environments 会议论文
, 厦门, 2019.12
作者:  Huimu, Wang;  Tenghai, Qiu;  Zhen, Liu;  Zhiqiang, Pu;  Jianqiang, Yi;  Zhaoyang, Liu
Adobe PDF(2178Kb)  |  收藏  |  浏览/下载:182/52  |  提交时间:2021/06/24
Conservative Policy Gradient in Multi-critic Setting 会议论文
, Hangzhou, China, 2019.11.22-24
作者:  Xi, Bao;  Wang, Rui;  Wang, Shuo;  Lu, Tao;  Cai, Yinghao
Adobe PDF(379Kb)  |  收藏  |  浏览/下载:208/71  |  提交时间:2021/02/02
inconsistancy  stablility  Q learning  policy gradient  
Parallel Adaptive Critic Designs of Optimal Control for Ice-Storage Air Conditioning Systems 会议论文
, Xiamen, China, 2019-12
作者:  Liao, Zehua;  Wei, Qinglai;  Song, Ruizhuo
Adobe PDF(199Kb)  |  收藏  |  浏览/下载:291/81  |  提交时间:2020/06/26
Parallel adaptive critic design  Adaptive dynamic programming  Particle swarm optimization  Ice-storage air conditioning  
Mixing Update Q-value for Deep Reinforcement Learning 会议论文
, Budapest, Hungary, 2019/7/14-19
作者:  Li Zhunan;  Hou Xinwen
Adobe PDF(468Kb)  |  收藏  |  浏览/下载:179/73  |  提交时间:2020/06/10
Connecting Model-Based and Model-Free Control With Emotion Modulation in Learning Systems 期刊论文
IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2019, 卷号: 10, 期号: 4, 页码: 1-15
作者:  Huang, Xiao;  Wu, Wei;  Qiao, Hong
Adobe PDF(1614Kb)  |  收藏  |  浏览/下载:272/90  |  提交时间:2020/06/09
Brain-inspired computing  decision-making  emotion modulation  emotion-cognition interactions  reinforcement learning  
Deep Reinforcement Learning of Robotic Precision Insertion Skill Accelerated by Demonstrations 会议论文
, Vancouver, British Columbia, Canada, 2019-08-22
作者:  Wu, Xiapeng;  Zhang, Dapeng;  Qin, Fangbo;  Xu, De
Adobe PDF(1748Kb)  |  收藏  |  浏览/下载:272/100  |  提交时间:2020/06/09
Robust Visual Detection and Tracking Strategies for Autonomous Aerial Refueling of UAVs 期刊论文
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2019, 卷号: 68, 期号: 12, 页码: 4640-4652
作者:  Sun, Siyang;  Yin, Yingjie;  Wang, Xingang;  Xu, De
Adobe PDF(11285Kb)  |  收藏  |  浏览/下载:289/59  |  提交时间:2020/03/30
Target tracking  Feature extraction  Object detection  Search problems  Proposals  Visualization  Detectors  Autonomous aerial refueling (AAR)  deep learning  object detection  object tracking  reinforcement learning  
Adaptive Tracking Control of Surface Vessel Using Optimized Backstepping Technique 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 9, 页码: 3420-3431
作者:  Wen, Guoxing;  Ge, Shuzhi Sam;  Chen, C. L. Philip;  Tu, Fangwen;  Wang, Shengnan
收藏  |  浏览/下载:191/0  |  提交时间:2019/12/16
Actor-critic architecture  Lyapunov stability  optimized backstepping (OB)  reinforcement learning (RL)  surface vessel  
Output Tracking Control Based on Adaptive Dynamic Programming With Multistep Policy Evaluation 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2019, 卷号: 49, 期号: 10, 页码: 2155-2165
作者:  Luo, Biao;  Liu, Derong;  Huang, Tingwen;  Liu, Jiangjiang
收藏  |  浏览/下载:237/0  |  提交时间:2019/12/16
Adaptive dynamic programming (ADP)  Bellman equation  heuristic dynamic programming  neural networks (NNs)  output tracking control