CASIA OpenIR

浏览/检索结果: 共14条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Learning to Navigate in Human Environments via Deep Reinforcement Learning 会议论文
, Sydney, Australia, 2019-12-12至2019-12-15
作者:  Xingyuan Gao;  Shiying Sun;  Xiaoguang Zhao;  Min Tan
Adobe PDF(1298Kb)  |  收藏  |  浏览/下载:149/44  |  提交时间:2022/03/31
Addressing Reward Engineering for Deep Reinforcement Learning on Multi-stage Task 会议论文
, Australia, 2019-12
作者:  Chen, Bin;  Su, Jianhua
浏览  |  Adobe PDF(1169Kb)  |  收藏  |  浏览/下载:264/76  |  提交时间:2020/06/08
Parallel Adaptive Critic Designs of Optimal Control for Ice-Storage Air Conditioning Systems 会议论文
, Xiamen, China, 2019-12
作者:  Liao, Zehua;  Wei, Qinglai;  Song, Ruizhuo
浏览  |  Adobe PDF(199Kb)  |  收藏  |  浏览/下载:274/75  |  提交时间:2020/06/26
Parallel adaptive critic design  Adaptive dynamic programming  Particle swarm optimization  Ice-storage air conditioning  
Multi-Agent Reinforcement Learning Based on Clustering in Two-Player Games 会议论文
, Xiamen, China, 2019-12-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(488Kb)  |  收藏  |  浏览/下载:108/35  |  提交时间:2023/06/28
reinforcement learning  unsupervised clustering  matrix game  
Natural Scene Facial Expression Recognition based on Differential Features 会议论文
, 中国杭州, 2019.11
作者:  Hu, Shenhua;  Hu, Yiming;  Li, Jianquan;  Chen, Yunze;  Chen, Mengjuan;  Gu, Qingyi
浏览  |  Adobe PDF(866Kb)  |  收藏  |  浏览/下载:240/69  |  提交时间:2020/06/10
Facial expression recognition  GAN with attention  Differential feature  
Conservative Policy Gradient in Multi-critic Setting 会议论文
, Hangzhou, China, 2019.11.22-24
作者:  Xi, Bao;  Wang, Rui;  Wang, Shuo;  Lu, Tao;  Cai, Yinghao
浏览  |  Adobe PDF(379Kb)  |  收藏  |  浏览/下载:185/63  |  提交时间:2021/02/02
inconsistancy  stablility  Q learning  policy gradient  
Mixing Update Q-value for Deep Reinforcement Learning 会议论文
, Budapest, Hungary, 2019/7/14-19
作者:  Li Zhunan;  Hou Xinwen
Adobe PDF(468Kb)  |  收藏  |  浏览/下载:158/65  |  提交时间:2020/06/10
Autonomous Navigation with Improved Hierarchical Neural Network Based on Deep Reinforcement Learning 会议论文
, 中国 广州, 2019.07.27-2019.07.30
作者:  Zhang, Haiying;  Qiu, Tenghai;  Li, Shuxiao;  Zhu, Chengfei;  Lan, Xiaosong;  Chang, Hongxing
浏览  |  Adobe PDF(349Kb)  |  收藏  |  浏览/下载:274/90  |  提交时间:2020/06/09
Autonomous Navigation  DDPG  Improved Hierarchical Neural Network  Curriculum Learning  
基于强化学习和非正交多址接入的车联网无线资源分配 会议论文
, 杭州, 中国, 11月22-24日
作者:  韩双双;  李卓珩;  杨林瑶;  王晓
浏览  |  Adobe PDF(2801Kb)  |  收藏  |  浏览/下载:278/64  |  提交时间:2020/03/18
Bootstrap Estimated Uncertainty of the Environment Model for Model-Based Reinforcement Learning 会议论文
, Honolulu, Hawaii, USA, 2019-1
作者:  Huang, Wenzhen;  Zhang, Junge;  Huang, Kaiqi
Adobe PDF(5079Kb)  |  收藏  |  浏览/下载:126/41  |  提交时间:2022/01/11