CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共28条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning 会议论文
, Kigali City, Rwanda, Africa, 2023-5-5
作者:  Junjie, Wang;  Yao, Mu;  Dong, Li;  Qichao,Zhang;  Dongbin, Zhao;  Yuzheng, Zhuang;  Ping, Luo;  Bin, Wang;  Jianye, Hao
Adobe PDF(3492Kb)  |  收藏  |  浏览/下载:133/38  |  提交时间:2023/06/29
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文
, 昆士兰, 2023-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4104Kb)  |  收藏  |  浏览/下载:221/72  |  提交时间:2023/06/29
multi-agent  reinforcement learning  policy gradient  
Multi-Agent Reinforcement Learning Based on Clustering in Two-Player Games 会议论文
, Xiamen, China, 2019-12-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(488Kb)  |  收藏  |  浏览/下载:120/39  |  提交时间:2023/06/28
reinforcement learning  unsupervised clustering  matrix game  
Multi-Objective Neural Architecture Search for Light-Weight Model 会议论文
, Hangzhou, China, 22-24 November 2019
作者:  Nannan Li;  Yaran Chen;  Zixiang Ding;  Dongbin Zhao;  Zhonghua Pang;  Ruisheng Qin
Adobe PDF(430Kb)  |  收藏  |  浏览/下载:131/43  |  提交时间:2023/06/27
Neural architecture search  light-weight  multi-objective  reinforcement learning  image classification  
Benchmarking lane-changing decision-making for deep reinforcement learning 会议论文
, Guangzhou, China, 2021-11
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(1117Kb)  |  收藏  |  浏览/下载:121/42  |  提交时间:2023/05/30
Optimal Pedestrian Evacuation in Building with Consecutive Differential Dynamic Programming 会议论文
, Budapest, Hungary, 2019-7-14
作者:  Zhu YH(朱圆恒);  Haibo He;  Dongbin Zhao;  Zhongsheng Hou
Adobe PDF(679Kb)  |  收藏  |  浏览/下载:61/30  |  提交时间:2023/05/22
A Review of Computational Intelligence for StarCraft AI 会议论文
, Bangalore, India, 18-21 Nov. 2018
作者:  Tang, Zhentao;  Shao, Kun;  Zhu, Yuanheng;  Li, Dong;  Zhao, Dongbin;  Huang, Tingwen
浏览  |  Adobe PDF(131Kb)  |  收藏  |  浏览/下载:492/225  |  提交时间:2019/04/25
An Autonomous Driving Experience Platform with Learning-Based Functions 会议论文
, Bangalore, India, 18-21 Nov. 2018
作者:  Li, Dong;  Zhao, Dongbin;  Zhang, Qichao;  Zhu, Yuanheng
Adobe PDF(215Kb)  |  收藏  |  浏览/下载:277/71  |  提交时间:2019/04/25
Event-triggered hinfinity control for continuous-time nonlinear system 会议论文
, *, 2015
作者:  Zhao,Dongbin(赵冬斌);  Zhang,Qichao;  Li,Xiangjun;  Kong,Lingda
Adobe PDF(365Kb)  |  收藏  |  浏览/下载:238/85  |  提交时间:2018/01/04
Deep reinforcement learning with Experience Replay based on SARSA 会议论文
, *, 2016-9
作者:  Zhao,Dongbin(赵冬斌);  Wang,Haitao;  Shao,Kun;  Zhu,Yuanheng
浏览  |  Adobe PDF(1288Kb)  |  收藏  |  浏览/下载:394/174  |  提交时间:2018/01/04
Deep Learning  Reinforcement Learning  Experience Replay  q Learning  Sarsa Learning