CASIA OpenIR

浏览/检索结果: 共41条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Multi-Agent Reinforcement Learning Based on Clustering in Two-Player Games 会议论文
, Xiamen, China, 2019-12-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(488Kb)  |  收藏  |  浏览/下载:103/34  |  提交时间:2023/06/28
reinforcement learning  unsupervised clustering  matrix game  
Vision-based control in the open racing car simulator with deep and reinforcement learning 期刊论文
Journal of Ambient Intelligence and Humanized Computing, 2019, 页码: doi={10.1007/s12652-019-01503-y}
作者:  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(2210Kb)  |  收藏  |  浏览/下载:39/8  |  提交时间:2023/04/26
Learning to Navigate in Human Environments via Deep Reinforcement Learning 会议论文
, Sydney, Australia, 2019-12-12至2019-12-15
作者:  Xingyuan Gao;  Shiying Sun;  Xiaoguang Zhao;  Min Tan
Adobe PDF(1298Kb)  |  收藏  |  浏览/下载:144/42  |  提交时间:2022/03/31
Bootstrap Estimated Uncertainty of the Environment Model for Model-Based Reinforcement Learning 会议论文
, Honolulu, Hawaii, USA, 2019-1
作者:  Huang, Wenzhen;  Zhang, Junge;  Huang, Kaiqi
Adobe PDF(5079Kb)  |  收藏  |  浏览/下载:124/40  |  提交时间:2022/01/11
MSCap: Multi-Style Image Captioning with Unpaired Stylized Text 会议论文
, 美国长滩, 2019.06.16
作者:  Longteng, Guo;  Jing, Liu;  Peng, Yao;  Jiangwei, Li;  Hanqing, Lu
Adobe PDF(914Kb)  |  收藏  |  浏览/下载:109/19  |  提交时间:2021/06/25
Time-sequence Action-Decision and Navigation Through Stage Deep Reinforcement Learning in Complex Dynamic Environments 会议论文
, 厦门, 2019.12
作者:  Huimu, Wang;  Tenghai, Qiu;  Zhen, Liu;  Zhiqiang, Pu;  Jianqiang, Yi;  Zhaoyang, Liu
Adobe PDF(2178Kb)  |  收藏  |  浏览/下载:150/42  |  提交时间:2021/06/24
基于渐进式关系学习的群体行为识别模型及其训练方法 专利
专利类型: 发明专利, 专利号: 201910798505.X, 申请日期: 2019-08-27,
发明人:  胡古月;  余山;  崔波;  何媛
Adobe PDF(1041Kb)  |  收藏  |  浏览/下载:112/0  |  提交时间:2021/05/29
Performance Evaluation and Improvement of Chipset Assembly & Test Production Line Based on Variability 期刊论文
International Journal of Automation and Computing, 2019, 卷号: 16, 期号: 2, 页码: 186-198
作者:  Chang-Jun Li;  Zong-Shi Xie;  Xin-Ran Peng;  Bo Li
浏览  |  Adobe PDF(1239Kb)  |  收藏  |  浏览/下载:116/38  |  提交时间:2021/02/22
Performance evaluation and improvement  chipset assembly & test production line (CATPL)  parameters  Little′s law  variability.  
Conservative Policy Gradient in Multi-critic Setting 会议论文
, Hangzhou, China, 2019.11.22-24
作者:  Xi, Bao;  Wang, Rui;  Wang, Shuo;  Lu, Tao;  Cai, Yinghao
浏览  |  Adobe PDF(379Kb)  |  收藏  |  浏览/下载:185/63  |  提交时间:2021/02/02
inconsistancy  stablility  Q learning  policy gradient  
Parallel Adaptive Critic Designs of Optimal Control for Ice-Storage Air Conditioning Systems 会议论文
, Xiamen, China, 2019-12
作者:  Liao, Zehua;  Wei, Qinglai;  Song, Ruizhuo
浏览  |  Adobe PDF(199Kb)  |  收藏  |  浏览/下载:270/73  |  提交时间:2020/06/26
Parallel adaptive critic design  Adaptive dynamic programming  Particle swarm optimization  Ice-storage air conditioning