CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共6条,第1-6条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
DIMSAN: Fast Exploration with the Synergy between Density-based Intrinsic Motivation and Self-adaptive Action Noise 会议论文
, 西安, 2021.5.30-2021.6.5
作者:  Li, Jiayi;  Li, Boyao;  Lu, Tao;  Lu, Ning;  Cai, Yinghao;  Wang, Shuo
Adobe PDF(5599Kb)  |  收藏  |  浏览/下载:211/41  |  提交时间:2022/06/14
Meta-Imitation Learning by Watching Video Demonstrations 会议论文
, 线上, 2022.4.25-2022.4.29
作者:  Li, Jiayi;  Lu, Tao;  Cao, Xiaoge;  Cai, Yinghao;  Wang, Shuo
Adobe PDF(8968Kb)  |  收藏  |  浏览/下载:244/61  |  提交时间:2022/06/14
Conservative Policy Gradient in Multi-critic Setting 会议论文
, Hangzhou, China, 2019.11.22-24
作者:  Xi, Bao;  Wang, Rui;  Wang, Shuo;  Lu, Tao;  Cai, Yinghao
浏览  |  Adobe PDF(379Kb)  |  收藏  |  浏览/下载:239/83  |  提交时间:2021/02/02
inconsistancy  stablility  Q learning  policy gradient  
ACDER: Augmented Curiosity-Driven Experience Replay 会议论文
, Paris, France, 2020.05.31-2020.08.31
作者:  Li, Boyao;  Lu, Tao;  Li, Jiayi;  Lu, Ning;  Cai, Yinghao;  Wang, Shuo
浏览  |  Adobe PDF(3303Kb)  |  收藏  |  浏览/下载:269/84  |  提交时间:2020/08/27
Curiosity-Driven Exploration for Off-Policy Reinforcement Learning Methods 会议论文
, Dali, China, 2019.12.06-2019.12.08
作者:  Li, Boyao;  Lu, Tao;  Li, Jiayi;  Lu, Ning;  Cai, Yinghao;  Wang, Shuo
浏览  |  Adobe PDF(2877Kb)  |  收藏  |  浏览/下载:216/75  |  提交时间:2020/08/27
An Automatic Robot Skills Learning System from Robot's Real-World Demonstrations 会议论文
, Nanchang, China, 2019.06.03-2019.06.05
作者:  Li, Boyao;  Lu, Tao;  Li, Xiaocan;  Cai, Yinghao;  Wang, Shuo
浏览  |  Adobe PDF(10072Kb)  |  收藏  |  浏览/下载:169/36  |  提交时间:2020/08/27
learn from demonstrations  simulation  real-world demonstrations  coordinate transformation