CASIA OpenIR

浏览/检索结果: 共12条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 13
作者:  Li, Tao;  Wei, Qinglai;  Wang, Fei-Yue
收藏  |  浏览/下载:46/0  |  提交时间:2024/02/22
Performance analysis  Optimal control  Dynamic programming  Iterative algorithms  Upper bound  Measurement  Convergence  Adaptive dynamic programming (ADP)  isoperimetric constraints  nonlinear systems  optimal control  policy iteration  
A Parallel Control Method For Zero-Sum Games With Unknown Time-varying System 期刊论文
The International Journal of Intelligent Control and Systems, 2023, 页码: 5页
作者:  Qinglai Wei;  Zhenhua Zhu;  Jie Zhang;  Feiyue Wang
Adobe PDF(470Kb)  |  收藏  |  浏览/下载:125/49  |  提交时间:2023/12/15
二人零和动态博弈的自学习平行控制方法研究 学位论文
, 2023
作者:  朱振华
Adobe PDF(1737Kb)  |  收藏  |  浏览/下载:136/6  |  提交时间:2023/12/15
自适应动态规划  平行控制  零和博弈  
Continuous-Time Stochastic Policy Iteration of Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 13
作者:  Wei, Qinglai;  Zhou, Tianmin;  Lu, Jingwei;  Liu, Yu;  Su, Shuai;  Xiao, Jun
收藏  |  浏览/下载:118/0  |  提交时间:2023/11/17
Adaptive dynamic programming (ADP)  Hamilton-Jacobi-Bellman equation (HJBE)  nonlinear stochastic system  stochastic policy iteration (PI)  
Data-Driven Optimal Output Cluster Synchronization Control of Heterogeneous Multi-Agent Systems 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 页码: 11
作者:  Li, Hongyang;  Wei, Qinglai
收藏  |  浏览/下载:54/0  |  提交时间:2023/11/17
Index Terms- Output cluster synchronization control  data-driven control  adaptive dynamic programming  policy iteration  heterogeneous multi-agent systems  optimal control  
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文
, 昆士兰, 2023-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4104Kb)  |  收藏  |  浏览/下载:218/71  |  提交时间:2023/06/29
multi-agent  reinforcement learning  policy gradient  
Curiosity-Driven and Victim-Aware Adversarial Policies 会议论文
, Austin TX, USA, December 5-9, 2022
作者:  Gong C(龚晨);  Yang Z(杨洲);  Bai YP(白云鹏);  Shi JK(史杰克);  Sinha Arunesh;  Xu BW(徐博文);  Lo David;  Hou XW(侯新文);  Fan GL(范国梁)
Adobe PDF(4090Kb)  |  收藏  |  浏览/下载:111/46  |  提交时间:2023/06/27
基于深度强化学习的超车换道决策方法 学位论文
, 2023
作者:  王俊杰
Adobe PDF(17475Kb)  |  收藏  |  浏览/下载:155/3  |  提交时间:2023/06/26
深度强化学习,自动驾驶,换道决策,基于模型值扩展,动力学泛化  
Optimal Strategy for Aircraft Pursuit-Evasion Games via Self-Play Iteration 期刊论文
Machine Intelligence Research, 2023, 页码: 1-12
作者:  Wang Xin;  Wei Qinglai;  Li Tao;  Zhang Jie
Adobe PDF(1556Kb)  |  收藏  |  浏览/下载:171/64  |  提交时间:2023/06/26
基于自适应动态规划的最优跟踪控制方法研究 学位论文
, 2023
作者:  王鑫
Adobe PDF(6647Kb)  |  收藏  |  浏览/下载:167/10  |  提交时间:2023/06/08
自适应动态规划  输出调节  追逃博弈  最优控制  一致性控制