CASIA OpenIR

浏览/检索结果: 共23条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Multi-Agent Reinforcement Learning Based on Clustering in Two-Player Games 会议论文
, Xiamen, China, 2019-12-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(488Kb)  |  收藏  |  浏览/下载:137/44  |  提交时间:2023/06/28
reinforcement learning  unsupervised clustering  matrix game  
Lane change decision-making through deep reinforcement learning with rule-based constraints 会议论文
, Budapest, Hungary, 2019-7
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌);  Chen YR(陈亚冉)
Adobe PDF(295Kb)  |  收藏  |  浏览/下载:129/39  |  提交时间:2023/05/30
Lane Change  Decision-making  Deep Reinforcement Learning  Deep Q-Network  
Multitask Policy Adversarial Learning for Human-Level Control With Large State Spaces 期刊论文
IEEE Transactions on Industrial Informatics Information, 2019, 卷号: 15, 期号: 4, 页码: 2395-2404
作者:  Wang JP(王军平);  You Kang Shi;  Wen Sheng Zhang;  Ian Thomas;  Shi Hui Duan
Adobe PDF(2547Kb)  |  收藏  |  浏览/下载:122/41  |  提交时间:2023/05/05
Adaptive Brightness Learning for Active Object Recognition 会议论文
, Brighton, UK, 2019.5.12-5.17
作者:  Xu, Nuo;  Huo, Chunlei;  Pan, Chunhong
Adobe PDF(3936Kb)  |  收藏  |  浏览/下载:49/8  |  提交时间:2022/12/20
Bootstrap Estimated Uncertainty of the Environment Model for Model-Based Reinforcement Learning 会议论文
, Honolulu, Hawaii, USA, 2019-1
作者:  Huang, Wenzhen;  Zhang, Junge;  Huang, Kaiqi
Adobe PDF(5079Kb)  |  收藏  |  浏览/下载:141/46  |  提交时间:2022/01/11
Conservative Policy Gradient in Multi-critic Setting 会议论文
, Hangzhou, China, 2019.11.22-24
作者:  Xi, Bao;  Wang, Rui;  Wang, Shuo;  Lu, Tao;  Cai, Yinghao
浏览  |  Adobe PDF(379Kb)  |  收藏  |  浏览/下载:214/73  |  提交时间:2021/02/02
inconsistancy  stablility  Q learning  policy gradient  
Mixing Update Q-value for Deep Reinforcement Learning 会议论文
, Budapest, Hungary, 2019/7/14-19
作者:  Li Zhunan;  Hou Xinwen
浏览  |  Adobe PDF(468Kb)  |  收藏  |  浏览/下载:183/74  |  提交时间:2020/06/10
Autonomous Navigation with Improved Hierarchical Neural Network Based on Deep Reinforcement Learning 会议论文
, 中国 广州, 2019.07.27-2019.07.30
作者:  Zhang, Haiying;  Qiu, Tenghai;  Li, Shuxiao;  Zhu, Chengfei;  Lan, Xiaosong;  Chang, Hongxing
Adobe PDF(349Kb)  |  收藏  |  浏览/下载:300/96  |  提交时间:2020/06/09
Autonomous Navigation  DDPG  Improved Hierarchical Neural Network  Curriculum Learning  
Deep Reinforcement Learning of Robotic Precision Insertion Skill Accelerated by Demonstrations 会议论文
, Vancouver, British Columbia, Canada, 2019-08-22
作者:  Wu, Xiapeng;  Zhang, Dapeng;  Qin, Fangbo;  Xu, De
浏览  |  Adobe PDF(1748Kb)  |  收藏  |  浏览/下载:276/100  |  提交时间:2020/06/09
无权访问的条目 会议论文
作者:  Xiong,, Fangzhou;  Liu, Zhiyong;  Huang, Kaizhu;  Yang, Xu;  Amir Hussain
Adobe PDF(277Kb)  |  收藏  |  浏览/下载:55/9  |  提交时间:2020/04/26