CASIA OpenIR

浏览/检索结果: 共12条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Multi-Agent Reinforcement Learning Based on Clustering in Two-Player Games 会议论文
, Xiamen, China, 2019-12-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(488Kb)  |  收藏  |  浏览/下载:144/46  |  提交时间:2023/06/28
reinforcement learning  unsupervised clustering  matrix game  
Lane change decision-making through deep reinforcement learning with rule-based constraints 会议论文
, Budapest, Hungary, 2019-7
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌);  Chen YR(陈亚冉)
Adobe PDF(295Kb)  |  收藏  |  浏览/下载:131/40  |  提交时间:2023/05/30
Lane Change  Decision-making  Deep Reinforcement Learning  Deep Q-Network  
Adaptive Brightness Learning for Active Object Recognition 会议论文
, Brighton, UK, 2019.5.12-5.17
作者:  Xu, Nuo;  Huo, Chunlei;  Pan, Chunhong
Adobe PDF(3936Kb)  |  收藏  |  浏览/下载:53/9  |  提交时间:2022/12/20
Bootstrap Estimated Uncertainty of the Environment Model for Model-Based Reinforcement Learning 会议论文
, Honolulu, Hawaii, USA, 2019-1
作者:  Huang, Wenzhen;  Zhang, Junge;  Huang, Kaiqi
Adobe PDF(5079Kb)  |  收藏  |  浏览/下载:145/48  |  提交时间:2022/01/11
Conservative Policy Gradient in Multi-critic Setting 会议论文
, Hangzhou, China, 2019.11.22-24
作者:  Xi, Bao;  Wang, Rui;  Wang, Shuo;  Lu, Tao;  Cai, Yinghao
浏览  |  Adobe PDF(379Kb)  |  收藏  |  浏览/下载:218/74  |  提交时间:2021/02/02
inconsistancy  stablility  Q learning  policy gradient  
Mixing Update Q-value for Deep Reinforcement Learning 会议论文
, Budapest, Hungary, 2019/7/14-19
作者:  Li Zhunan;  Hou Xinwen
浏览  |  Adobe PDF(468Kb)  |  收藏  |  浏览/下载:186/75  |  提交时间:2020/06/10
Autonomous Navigation with Improved Hierarchical Neural Network Based on Deep Reinforcement Learning 会议论文
, 中国 广州, 2019.07.27-2019.07.30
作者:  Zhang, Haiying;  Qiu, Tenghai;  Li, Shuxiao;  Zhu, Chengfei;  Lan, Xiaosong;  Chang, Hongxing
Adobe PDF(349Kb)  |  收藏  |  浏览/下载:305/98  |  提交时间:2020/06/09
Autonomous Navigation  DDPG  Improved Hierarchical Neural Network  Curriculum Learning  
Deep Reinforcement Learning of Robotic Precision Insertion Skill Accelerated by Demonstrations 会议论文
, Vancouver, British Columbia, Canada, 2019-08-22
作者:  Wu, Xiapeng;  Zhang, Dapeng;  Qin, Fangbo;  Xu, De
Adobe PDF(1748Kb)  |  收藏  |  浏览/下载:278/101  |  提交时间:2020/06/09
无权访问的条目 会议论文
作者:  Xiong,, Fangzhou;  Liu, Zhiyong;  Huang, Kaizhu;  Yang, Xu;  Amir Hussain
Adobe PDF(277Kb)  |  收藏  |  浏览/下载:55/9  |  提交时间:2020/04/26
Learning Deep Decentralized Policy Network by Collective Rewards for Real-Time Combat Game 会议论文
, Macao, China, August 10-16, 2019
作者:  Peixi Peng;  Junliang Xing;  Lili Cao;  Lisen Mu;  Chang Huang
浏览  |  Adobe PDF(762Kb)  |  收藏  |  浏览/下载:345/127  |  提交时间:2019/10/10
Multi-agent Learning  Deep Decentralized Policy Network  Real-time Combat Game