CASIA OpenIR

浏览/检索结果: 共28条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Potential Driven Reinforcement Learning for Hard Exploration Tasks 会议论文
, 线上, 2020-4
作者:  Zhao EM(赵恩民);  Deng SH(邓诗弘);  Zang YF(臧一凡);  Kang YX(康永欣);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(1999Kb)  |  收藏  |  浏览/下载:86/31  |  提交时间:2023/06/29
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:105/31  |  提交时间:2023/06/27
Efficient cooperative structured control for a multi-joint biomimetic robotic fish 期刊论文
IEEE/ASME Transactions on Mechatronics, 2020, 卷号: 26, 期号: 5, 页码: 2506-2516
作者:  Yan Shuaizheng;  Wu Zhengxing;  Wang Jian;  Tan Min;  Yu Junzhi
Adobe PDF(2394Kb)  |  收藏  |  浏览/下载:85/32  |  提交时间:2023/05/31
Wd3: Taming the estimation bias in deep reinforcement learning 会议论文
, Baltimore, MD, USA, 2020-12
作者:  He Q(何强);  Hou XW(侯新文)
Adobe PDF(2006Kb)  |  收藏  |  浏览/下载:204/39  |  提交时间:2022/06/27
deep reinforcement learning  estimation bias  neural networks  
An IoT Edge Computing System Architecture and its Application 会议论文
, Nanjing, China, Oct. 30 – Nov.2, 2020.
作者:  Shichao Chen;  Qijie Li;  Hua Zhang;  fenghua.zhu@ia.ac.cn;  Gang Xiong;  Ying Tang
Adobe PDF(4370Kb)  |  收藏  |  浏览/下载:216/75  |  提交时间:2022/04/08
IoT  Edge computing  Energy monitoring and optimization Introduction  
Face Anti-Spoofing by Learning Polarization Cues in a Real-World Scenario 会议论文
, Chengdu, China, November 13 - 15, 2020
作者:  Tian, Yu;  Zhang, Kunbo;  Wang, Leyuan;  Sun, Zhenan
Adobe PDF(3838Kb)  |  收藏  |  浏览/下载:209/43  |  提交时间:2021/10/08
Distill and Replay for Continual Language Learning 会议论文
, Barcelona, Spain (Online), 2020-12-8
作者:  Sun, Jingyuan;  Wang, Shaonan;  Zhang, Jiajun;  Zong, Chengqing
Adobe PDF(769Kb)  |  收藏  |  浏览/下载:203/57  |  提交时间:2021/06/28
Path Planning for Intelligent Robots Based on Deep Q-learning With Experience Replay and Heuristic Knowledge 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2020, 卷号: 7, 期号: 4, 页码: 1179-1189
作者:  Lan Jiang;  Hongyun Huang;  Zuohua Ding
Adobe PDF(1955Kb)  |  收藏  |  浏览/下载:103/43  |  提交时间:2021/03/11
Deep Q-learning (DQL)  experience replay (ER)  heuristic knowledge (HK)  path planning  
Approximate Dynamic Programming for Stochastic Resource Allocation Problems 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2020, 卷号: 7, 期号: 4, 页码: 975-990
作者:  Ali Forootani;  Raffaele Iervolino;  Massimo Tipaldi;  Joshua Neilson
Adobe PDF(3558Kb)  |  收藏  |  浏览/下载:124/36  |  提交时间:2021/03/11
Approximate dynamic programming (ADP)  dynamic programming (DP)  Markov decision processes (MDPs)  resource allocation problem  
Environmental Adaptive Control of a Snake-like Robot With Variable Stiffness Actuators 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2020, 卷号: 7, 期号: 3, 页码: 745-751
作者:  Dong Zhang;  Hao Yuan;  Zhengcai Cao
Adobe PDF(10652Kb)  |  收藏  |  浏览/下载:160/37  |  提交时间:2021/03/11
Adaptive control  snake-like robot  variable stiffness