CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共24条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文
, 线上, 2020-5
作者:  Liu MS(刘民颂);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(727Kb)  |  收藏  |  浏览/下载:15/7  |  提交时间:2024/07/04
Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning 会议论文
, Kigali City, Rwanda, Africa, 2023-5-5
作者:  Junjie, Wang;  Yao, Mu;  Dong, Li;  Qichao,Zhang;  Dongbin, Zhao;  Yuzheng, Zhuang;  Ping, Luo;  Bin, Wang;  Jianye, Hao
Adobe PDF(3492Kb)  |  收藏  |  浏览/下载:162/43  |  提交时间:2023/06/29
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文
, 昆士兰, 2023-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4104Kb)  |  收藏  |  浏览/下载:248/80  |  提交时间:2023/06/29
multi-agent  reinforcement learning  policy gradient  
Multi-Agent Reinforcement Learning Based on Clustering in Two-Player Games 会议论文
, Xiamen, China, 2019-12-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(488Kb)  |  收藏  |  浏览/下载:153/48  |  提交时间:2023/06/28
reinforcement learning  unsupervised clustering  matrix game  
Shift-Invariant Convolutional Network Search 会议论文
, Glasgow, United Kingdom, 19-24 July
作者:  Nannan Li;  Yaran Chen;  Zixiang Ding;  Dongbin Zhao
Adobe PDF(1067Kb)  |  收藏  |  浏览/下载:150/41  |  提交时间:2020/10/20
Neural architecture search, shift-invariant, multi-objective, low-pass filter, image classification  
Event-triggered hinfinity control for continuous-time nonlinear system 会议论文
, *, 2015
作者:  Zhao,Dongbin(赵冬斌);  Zhang,Qichao;  Li,Xiangjun;  Kong,Lingda
浏览  |  Adobe PDF(365Kb)  |  收藏  |  浏览/下载:263/90  |  提交时间:2018/01/04
Deep reinforcement learning with Experience Replay based on SARSA 会议论文
, *, 2016-9
作者:  Zhao,Dongbin(赵冬斌);  Wang,Haitao;  Shao,Kun;  Zhu,Yuanheng
浏览  |  Adobe PDF(1288Kb)  |  收藏  |  浏览/下载:412/180  |  提交时间:2018/01/04
Deep Learning  Reinforcement Learning  Experience Replay  q Learning  Sarsa Learning  
Comparison of methods to efficient graph SLAM under general optimization framework 会议论文
YAC 2017
作者:  Haoran Li;  Qichao Zhang;  Dongbin Zhao
浏览  |  Adobe PDF(151Kb)  |  收藏  |  浏览/下载:937/529  |  提交时间:2017/12/31
Optimization  Slam  Pose Graph  
Thermal Comfort Control Based on MEC Algorithm for HVAC System 会议论文
, Killarney, Ireland, 12-17 July 2015
作者:  Li, Dong;  Zhao, Dongbin;  Zhu, Yuanheng;  Xia, Zhongpu
浏览  |  Adobe PDF(895Kb)  |  收藏  |  浏览/下载:200/80  |  提交时间:2017/12/28
Event-Triggered H∞ Control for Continuous-Time Nonlinear System 会议论文
, Jeju, South Korea, October 15-18
作者:  Zhao,Dongbin;  Zhang,Qichao;  Li,Xiangjun;  Kong,Lingda
浏览  |  Adobe PDF(365Kb)  |  收藏  |  浏览/下载:203/53  |  提交时间:2017/12/28