CASIA OpenIR

浏览/检索结果: 共13条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 页码: 1-13
作者:  Guangzheng Hu;  Yuanheng Zhu;  Haoran Li;  Dongbin Zhao
Adobe PDF(2144Kb)  |  收藏  |  浏览/下载:39/7  |  提交时间:2024/06/05
Games  Q-learning  Task analysis  Optimization  Convergence  Training  Nash equilibrium  Multi-agent reinforcement learning  minimax-Q learning  two-team zero-sum Markov games  
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 卷号: 54, 期号: 3, 页码: 1414-1426
作者:  Li, Tao;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(784Kb)  |  收藏  |  浏览/下载:116/21  |  提交时间:2024/02/22
Performance analysis  Optimal control  Dynamic programming  Iterative algorithms  Upper bound  Measurement  Convergence  Adaptive dynamic programming (ADP)  isoperimetric constraints  nonlinear systems  optimal control  policy iteration  
A New Noise-Tolerant Dual-Neural-Network Scheme for Robust Kinematic Control of Robotic Arms With Unknown Models 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 10, 页码: 1778-1791
作者:  Ning Tan;  Peng Yu;  Zhiyan Zhong;  Fenglei Ni
Adobe PDF(52574Kb)  |  收藏  |  浏览/下载:227/48  |  提交时间:2022/09/08
Dual zeroing neural networks (ZNN)  finite-time convergence  model-free  robot control  robustness analysis  
Convergence Analysis of a Self-Stabilizing Algorithm for Minor Component Analysis 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2020, 卷号: 7, 期号: 6, 页码: 1585-1592
作者:  Haidi Dong;  Yingbin Gao;  Gang Liu
浏览  |  Adobe PDF(1583Kb)  |  收藏  |  浏览/下载:255/52  |  提交时间:2021/03/11
Convergence analysis  deterministic discrete time (DDT)  dynamic characteristic  Möller algorithm  
Model-Free H-infinity Optimal Tracking Control of Constrained Nonlinear Systems via an Iterative Adaptive Learning Algorithm 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 卷号: 50, 期号: 11, 页码: 4097-4108
作者:  Hou, Jiaxu;  Wang, Ding;  Liu, Derong;  Zhang, Yun
收藏  |  浏览/下载:233/0  |  提交时间:2021/01/07
Adaptive dynamic programming (ADP)  control constraints  convergence analysis  H-infinity tracking  neural network (NN)  optimal control  
The Strength of Nesterov's Extrapolation in the Individual Convergence of Nonsmooth Optimization 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 7, 页码: 2557-2568
作者:  Tao, Wei;  Pan, Zhisong;  Wu, Gaowei;  Tao, Qing
收藏  |  浏览/下载:254/0  |  提交时间:2020/08/03
Convergence  Extrapolation  Optimization  Acceleration  Machine learning  Task analysis  Machine learning algorithms  Individual convergence  machine learning  Nesterov's extrapolation  nonsmooth optimization  sparsity  
A Primal Neural Network for Online Equality-Constrained Quadratic Programming 期刊论文
COGNITIVE COMPUTATION, 2018, 卷号: 10, 期号: 2, 页码: 381-388
作者:  Chen, Ke;  Zhang, Zhaoxiang
收藏  |  浏览/下载:202/0  |  提交时间:2018/02/05
Recurrent Neural Networks  Online Equality-constrained Quadratic Programming  Global Exponential Convergence  Robustness Analysis  
An Improved Recurrent Network for Online Equality-Constrained Quadratic Programming 会议论文
BICS 2016, Beijing, China, 28-30 November 2016
作者:  Ke Chen;  Zhaoxiang Zhang
收藏  |  浏览/下载:224/0  |  提交时间:2017/02/09
Recurrent Networks  Online Equality-constrained Quadratic Programming  Global Exponential Convergence  Robustness Analysis  
A comparative study of two modeling approaches in neural networks 期刊论文
NEURAL NETWORKS, 2004, 卷号: 17, 期号: 1, 页码: 73-85
作者:  Xu, ZB;  Qiao, H;  Peng, JG;  Zhang, B
收藏  |  浏览/下载:144/0  |  提交时间:2016/12/05
Static Neural Network Modeling  Local Field Neural Network Modeling  Recurrent Neural Networks  Stability Analysis  Asymptotic Stability  Exponential Stability  Global Convergence  Globally Attractive  
beta-invariant measures for transition matrices of GI/M/1 type 期刊论文
STOCHASTIC MODELS, 2003, 卷号: 19, 期号: 2, 页码: 201-233
作者:  Li, QL;  Zhao, YQ
收藏  |  浏览/下载:121/0  |  提交时间:2015/11/08
Beta-invariant Measures  Quasi-stationary Distributions  Markov Chains Of Gi/m/1 Type  Radius Of Convergence  The Rg-factorizations  Spectral Analysis