CASIA OpenIR

浏览/检索结果: 共665条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文
, 线上, 2020-5
作者:  Liu MS(刘民颂);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(727Kb)  |  收藏  |  浏览/下载:15/7  |  提交时间:2024/07/04
面向多机器人博弈的深度强化学习方法 学位论文
, 2024
作者:  胡光政
Adobe PDF(17740Kb)  |  收藏  |  浏览/下载:21/0  |  提交时间:2024/07/04
多智能体深度强化学习  多机器人博弈  极小极大Q学习  值分解  最大熵  
Online Off-Policy Reinforcement Learning for Optimal Control of Unknown Nonlinear Systems Using Neural Networks 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 页码: 11
作者:  Zhu, Liao;  Wei, Qinglai;  Guo, Ping
收藏  |  浏览/下载:4/0  |  提交时间:2024/07/03
Adaptive dynamic programming  nonlinear systems  online learning  optimal control  reinforcement learning (RL)  
Boosting On-Policy Actor-Critic With Shallow Updates in Critic 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 10
作者:  Li, Luntong;  Zhu, Yuanheng
收藏  |  浏览/下载:6/0  |  提交时间:2024/07/03
Artificial neural networks  Vectors  Task analysis  Training  Representation learning  Approximation algorithms  Optimization  Actor-critic  deep reinforcement learning (DRL)  proximal policy optimization (PPO)  shallow reinforcement learning (SRL)  
Online Adaptive Dynamic Programming for Optimal Self-Learning Control of VTOL Aircraft Systems With Disturbances 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 卷号: 21, 期号: 1, 页码: 343-352
作者:  Wei, Qinglai;  Yang, Zesheng;  Su, Huaizhong;  Wang, Lijian
收藏  |  浏览/下载:3/0  |  提交时间:2024/07/03
Adaptive dynamic programming (ADP)  VTOL aircraft system  policy iteration  neural network (NN)  optimal control  iterative errors  
Synergetic Learning Neuro-Control for Unknown Affine Nonlinear Systems With Asymptotic Stability Guarantees 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 11
作者:  Zhu, Liao;  Wei, Qinglai;  Guo, Ping
收藏  |  浏览/下载:4/0  |  提交时间:2024/07/03
Approximate dynamic programming (ADP)  neural network  off-policy  optimal control  reinforcement learning (RL)  
Uncertainty-aware Boundary Attention Network for Real-time Semantic Segmentation 会议论文
, 中国福建厦门, 2023年10月13日
作者:  Zhu YB(朱袁兵);  Zhu BK(朱炳科);  Chen YY(陈盈盈);  Wang JQ(王金桥)
Adobe PDF(2957Kb)  |  收藏  |  浏览/下载:16/8  |  提交时间:2024/06/27
Uncertainty Estimation  Real-time Semantic Segmentation  
自然语言嵌入的深度强化学习探索方法研究 学位论文
, 2024
作者:  郭洲蕊
Adobe PDF(7588Kb)  |  收藏  |  浏览/下载:32/1  |  提交时间:2024/06/26
深度强化学习  自然语言  探索  
数据驱动的可控植物生长环境建模与调控 学位论文
, 2024
作者:  赵晓璇
Adobe PDF(4026Kb)  |  收藏  |  浏览/下载:16/0  |  提交时间:2024/06/25
数据驱动  温室气候模型  环境参数  温室气候调控  深度强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:14/7  |  提交时间:2024/06/25