CASIA OpenIR

浏览/检索结果: 共363条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
A Novel Parallel Control Method for Optimal Consensus of Nonlinear Multiagent Systems 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 页码: 14
作者:  Jiao, Shanshan;  Wei, Qinglai;  Wang, Fei-Yue
收藏  |  浏览/下载:1/0  |  提交时间:2024/07/22
Consensus control  Optimal control  Performance analysis  Vectors  Heuristic algorithms  Multi-agent systems  Dynamic programming  Adaptive dynamic programming (ADP)  coupled Hamilton-Jacobi  optimal consensus control  parallel control  
An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文
, 线上, 2020-5
作者:  Liu MS(刘民颂);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(727Kb)  |  收藏  |  浏览/下载:22/10  |  提交时间:2024/07/04
Online Off-Policy Reinforcement Learning for Optimal Control of Unknown Nonlinear Systems Using Neural Networks 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 页码: 11
作者:  Zhu, Liao;  Wei, Qinglai;  Guo, Ping
收藏  |  浏览/下载:7/0  |  提交时间:2024/07/03
Adaptive dynamic programming  nonlinear systems  online learning  optimal control  reinforcement learning (RL)  
Boosting On-Policy Actor-Critic With Shallow Updates in Critic 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 10
作者:  Li, Luntong;  Zhu, Yuanheng
收藏  |  浏览/下载:8/0  |  提交时间:2024/07/03
Artificial neural networks  Vectors  Task analysis  Training  Representation learning  Approximation algorithms  Optimization  Actor-critic  deep reinforcement learning (DRL)  proximal policy optimization (PPO)  shallow reinforcement learning (SRL)  
Online Adaptive Dynamic Programming for Optimal Self-Learning Control of VTOL Aircraft Systems With Disturbances 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 卷号: 21, 期号: 1, 页码: 343-352
作者:  Wei, Qinglai;  Yang, Zesheng;  Su, Huaizhong;  Wang, Lijian
收藏  |  浏览/下载:7/0  |  提交时间:2024/07/03
Adaptive dynamic programming (ADP)  VTOL aircraft system  policy iteration  neural network (NN)  optimal control  iterative errors  
Synergetic Learning Neuro-Control for Unknown Affine Nonlinear Systems With Asymptotic Stability Guarantees 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 11
作者:  Zhu, Liao;  Wei, Qinglai;  Guo, Ping
收藏  |  浏览/下载:9/0  |  提交时间:2024/07/03
Approximate dynamic programming (ADP)  neural network  off-policy  optimal control  reinforcement learning (RL)  
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:35/15  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Soft Contrastive Learning with Q-irrelevance Abstraction for Reinforcement Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2023, 卷号: 15, 期号: 3, 页码: 1463 - 1473
作者:  Liu MS(刘民颂);  Li LT(李伦通);  Hao S(郝帅);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4197Kb)  |  收藏  |  浏览/下载:43/15  |  提交时间:2024/06/24
Enhancing Reinforcement Learning via Transformer-based State Predictive Representations 期刊论文
IEEE Transactions on Artificial Intelligence, 2024, 页码: 1 - 12
作者:  Liu MS(刘民颂);  Zhu YH(朱圆恒);  Chen YR(陈亚冉);  Zhao DB(赵冬斌)
Adobe PDF(1162Kb)  |  收藏  |  浏览/下载:29/8  |  提交时间:2024/06/24
Modeling Socially Normative Navigation Behaviors from Demonstrations with Inverse Reinforcement Learning 会议论文
, Vancouver, British Columbia, Canada, 2019-08-22至2019-08-26
作者:  Xingyuan Gao;  Xiaoguang Zhao;  Min Tan
Adobe PDF(1500Kb)  |  收藏  |  浏览/下载:32/14  |  提交时间:2024/06/21