CASIA OpenIR

浏览/检索结果: 共41条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Lane change decision-making through deep reinforcement learning with rule-based constraints 会议论文
, Budapest, Hungary, 2019-7
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌);  Chen YR(陈亚冉)
Adobe PDF(295Kb)  |  收藏  |  浏览/下载:129/39  |  提交时间:2023/05/30
Lane Change  Decision-making  Deep Reinforcement Learning  Deep Q-Network  
Optimal Pedestrian Evacuation in Building with Consecutive Differential Dynamic Programming 会议论文
, Budapest, Hungary, 2019-7-14
作者:  Zhu YH(朱圆恒);  Haibo He;  Dongbin Zhao;  Zhongsheng Hou
Adobe PDF(679Kb)  |  收藏  |  浏览/下载:65/32  |  提交时间:2023/05/22
Multitask Policy Adversarial Learning for Human-Level Control With Large State Spaces 期刊论文
IEEE Transactions on Industrial Informatics Information, 2019, 卷号: 15, 期号: 4, 页码: 2395-2404
作者:  Wang JP(王军平);  You Kang Shi;  Wen Sheng Zhang;  Ian Thomas;  Shi Hui Duan
Adobe PDF(2547Kb)  |  收藏  |  浏览/下载:122/41  |  提交时间:2023/05/05
Vision-based control in the open racing car simulator with deep and reinforcement learning 期刊论文
Journal of Ambient Intelligence and Humanized Computing, 2019, 页码: doi={10.1007/s12652-019-01503-y}
作者:  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(2210Kb)  |  收藏  |  浏览/下载:53/13  |  提交时间:2023/04/26
Attention-Aware Sampling via Deep Reinforcement Learning for Action Recognition 会议论文
, Hilton Hawaiian Village, Honolulu, Hawaii, USA, January 27 – February 1, 2019
作者:  Dong, Wenkai;  Zhang, Zhaoxiang;  Tan, Tieniu
Adobe PDF(506Kb)  |  收藏  |  浏览/下载:198/66  |  提交时间:2022/06/14
Learning to Navigate in Human Environments via Deep Reinforcement Learning 会议论文
, Sydney, Australia, 2019-12-12至2019-12-15
作者:  Xingyuan Gao;  Shiying Sun;  Xiaoguang Zhao;  Min Tan
Adobe PDF(1298Kb)  |  收藏  |  浏览/下载:223/56  |  提交时间:2022/03/31
Bootstrap Estimated Uncertainty of the Environment Model for Model-Based Reinforcement Learning 会议论文
, Honolulu, Hawaii, USA, 2019-1
作者:  Huang, Wenzhen;  Zhang, Junge;  Huang, Kaiqi
Adobe PDF(5079Kb)  |  收藏  |  浏览/下载:141/46  |  提交时间:2022/01/11
Augmented Visual-semantic Embeddings for Image and Sentence Matching 会议论文
, Taipei, 2019.9.22-2019.9.25
作者:  Chen, Zerui;  Huang, Yan;  Wang, Liang
Adobe PDF(503Kb)  |  收藏  |  浏览/下载:181/62  |  提交时间:2021/06/03
Management of Control Impacts Based on Maximizing the Spread of Influence 期刊论文
International Journal of Automation and Computing, 2019, 卷号: 16, 期号: 3, 页码: 341-353
作者:  Alexander Tselykh;  Vladislav Vasilev;  Larisa Tselykh
浏览  |  Adobe PDF(1202Kb)  |  收藏  |  浏览/下载:130/48  |  提交时间:2021/02/22
Directed weighted graphs  control impact  spread of influence  optimization algorithm  growth model.  
Conservative Policy Gradient in Multi-critic Setting 会议论文
, Hangzhou, China, 2019.11.22-24
作者:  Xi, Bao;  Wang, Rui;  Wang, Shuo;  Lu, Tao;  Cai, Yinghao
浏览  |  Adobe PDF(379Kb)  |  收藏  |  浏览/下载:214/73  |  提交时间:2021/02/02
inconsistancy  stablility  Q learning  policy gradient