CASIA OpenIR

浏览/检索结果: 共23条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
基于强化学习的波动鳍推进水下作业机器人悬停控制 期刊论文
控制理论与应用, 2022, 卷号: 39, 期号: 11, 页码: 2022-2099
作者:  马睿宸;  白雪剑;  王宇;  王睿;  王硕
Adobe PDF(5386Kb)  |  收藏  |  浏览/下载:109/44  |  提交时间:2023/08/02
水下作业机器人  悬停控制  波动鳍  神经网络  强化学习  
Dynamic-horizon model-based value estimation with latent imagination 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-14
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(2305Kb)  |  收藏  |  浏览/下载:156/59  |  提交时间:2023/05/30
Latent world model  model-based value expansion (MVE)  reinforcement learning  reinforcement learning  
Data-driven adaptive-critic optimal output regulation towards water level control of boiler-turbine systems 期刊论文
Expert Systems with Applications, 2022, 页码: 117883
作者:  Wei Qinglai;  Wang Xin;  Liu Yu;  Xiong Gang
Adobe PDF(2135Kb)  |  收藏  |  浏览/下载:146/52  |  提交时间:2023/05/23
Terrain-Adaptive Longitudinal Control for Autonomous Trucks 会议论文
, Macau, China, 2022.10.08
作者:  Xiaoyu Xiong;  Bin Tian;  Rui Zhang;  Yang Sun;  Long Chen
Adobe PDF(2543Kb)  |  收藏  |  浏览/下载:85/29  |  提交时间:2023/05/06
Soft Contrastive Learning with Q-irrelevance Abstraction for Reinforcement Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2022, 页码: doi={10.1109/TCDS.2022.3218940}
作者:  Minsong Liu;  Luntong Li;  Shuai Hao;  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(12013Kb)  |  收藏  |  浏览/下载:73/19  |  提交时间:2023/04/26
Empirical Policy Optimization for n-Player Markov Games 期刊论文
IEEE Transactions on Cybernetics, 2022, 页码: doi={10.1109/TCYB.2022.3179775}
作者:  Yuanheng Zhu;  Weifan Li;  Mengchen Zhao;  Jianye Hao;  Dongbin Zhao
Adobe PDF(1739Kb)  |  收藏  |  浏览/下载:95/38  |  提交时间:2023/04/26
SURRL: Structural Unsupervised Representations for Robot Learning 期刊
创刊日期: 2022,
主办者:  Zhang FY(张丰一), Yurou Chen, Hong Qiao, Zhiyong Liu
Adobe PDF(7817Kb)  |  收藏  |  浏览/下载:246/89  |  提交时间:2023/01/12
Reinforcement learning  structural representations learning  multi-task learning  robotics  
AME: Attention and Memory Enhancement in Hyper-Parameter Optimization 会议论文
, New Orleans, USA, 2022.6.19-6.24
作者:  Xu, Nuo;  Chang, Jianlong;  Nie, Xing;  Huo, Chunlei;  Xiang, Shiming;  Pan, Chunhong
Adobe PDF(913Kb)  |  收藏  |  浏览/下载:176/42  |  提交时间:2022/12/20
Monte Carlo-based reinforcement learning control for unmanned aerial vehicle systems 期刊论文
NEUROCOMPUTING, 2022, 卷号: 507, 页码: 282-291
作者:  Wei, Qinglai;  Yang, Zesheng;  Su, Huaizhong;  Wang, Lijian
收藏  |  浏览/下载:211/0  |  提交时间:2022/09/19
Reinforcement learning  Adaptive dynamic programming (ADP)  UAV control  Monte Carlo simulation  Neural networks  
HMDRL: Hierarchical Mixed Deep Reinforcement Learning to Balance Vehicle Supply and Demand 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 页码: 12
作者:  Xi, Jinhao;  Zhu, Fenghua;  Ye, Peijun;  Lv, Yisheng;  Tang, Haina;  Wang, Fei-Yue
Adobe PDF(3316Kb)  |  收藏  |  浏览/下载:260/30  |  提交时间:2022/09/19
deep reinforcement learning  online ride-hailing system  hierarchical repositioning framework  parallel coordination mechanism  mixed state