CASIA OpenIR

浏览/检索结果: 共38条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Attention Enhanced Reinforcement Learning for Multi agent Cooperation 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 15
作者:  Pu, Zhiqiang;  Wang, Huimu;  Liu, Zhen;  Yi, Jianqiang;  Wu, Shiguang
Adobe PDF(2967Kb)  |  收藏  |  浏览/下载:266/38  |  提交时间:2022/06/06
Training  Reinforcement learning  Games  Scalability  Task analysis  Standards  Optimization  Attention mechanism  deep reinforcement learning (DRL)  graph convolutional networks  multi agent systems  
Model-Free Adaptive Optimal Control for Unknown Nonlinear Multiplayer Nonzero-Sum Game 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 2, 页码: 879-892
作者:  Wei, Qinglai;  Zhu, Liao;  Song, Ruizhuo;  Zhang, Pinjia;  Liu, Derong;  Xiao, Jun
收藏  |  浏览/下载:210/0  |  提交时间:2022/03/17
Heuristic algorithms  Nonlinear systems  Optimal control  Mathematical model  Dynamic programming  Games  Adaptive systems  Adaptive dynamic programming (ADP)  globalized dual-heuristic dynamic programming (GDHP)  multiplayer nonzero-sum game (MP-NZSG)  neural network (NN)  
Stacked BNAS: Rethinking Broad Convolutional Neural Network for Neural Architecture Search 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 0, 期号: 0, 页码: 0
作者:  Zixiang, Ding;  Yaran, Chen;  Nannan, Li;  Dongbin, Zhao;  C.L.Philip Chen,
Adobe PDF(764Kb)  |  收藏  |  浏览/下载:176/28  |  提交时间:2022/01/07
broad neural architecture search, stacked broad convolutional neural network, knowledge embedding search, image classification.  
Neuro-Optimal Trajectory Tracking With Value Iteration of Discrete-Time Nonlinear Dynamics 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Wang, Ding;  Ha, Mingming;  Cheng, Long
收藏  |  浏览/下载:216/0  |  提交时间:2022/01/27
Trajectory  Heuristic algorithms  Convergence  Trajectory tracking  Stability criteria  Optimal control  Dynamic programming  Adaptive critic design  discrete-time nonlinear plants  neuro-optimal trajectory tracking  uniformly ultimately bounded stability  value iteration  
Spiking Adaptive Dynamic Programming Based on Poisson Process for Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 11
作者:  Wei, Qinglai;  Han, Liyuan;  Zhang, Tielin
收藏  |  浏览/下载:151/0  |  提交时间:2022/01/27
Process control  Control systems  Optimal control  Maximum likelihood estimation  Performance analysis  Dynamic programming  Nonlinear systems  Maximum likelihood estimation (MLE)  nonlinear systems  optimal control  Poisson process  spike train  spiking adaptive dynamic programming (SADP)  
Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 12, 页码: 5245-5256
作者:  Wei, Qinglai;  Wang, Lingxiao;  Liu, Yu;  Polycarpou, Marios M.
Adobe PDF(4019Kb)  |  收藏  |  浏览/下载:290/74  |  提交时间:2021/03/08
Elevators  Optimal control  Backpropagation  Machine learning  Neural networks  Learning (artificial intelligence)  Actor  –critic  adaptive dynamic programming  deep learning (DL)  elevator group control (EGC)  optimal control  reinforcement learning (RL)  
Reinforcement Learning-Based Optimal Stabilization for Unknown Nonlinear Systems Subject to Inputs With Uncertain Constraints 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 10, 页码: 4330-4340
作者:  Zhao, Bo;  Liu, Derong;  Luo, Chaomin
收藏  |  浏览/下载:182/0  |  提交时间:2021/01/07
Nonlinear systems  Optimal control  Artificial neural networks  Actuators  Observers  Feedforward systems  Adaptive dynamic programming (ADP)  neural networks (NNs)  optimal control  reinforcement learning (RL)  uncertain input constraints  unknown nonlinear systems  
Complex-Valued Discrete-Time Neural Dynamics for Perturbed Time-Dependent Complex Quadratic Programming With Applications 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 9, 页码: 3555-3569
作者:  Qi, Yimeng;  Jin, Long;  Wang, Yaonan;  Xiao, Lin;  Zhang, Jiliang
收藏  |  浏览/下载:169/0  |  提交时间:2021/01/07
Computational modeling  Convergence  Mathematical model  Recurrent neural networks  Perturbation methods  Robots  Numerical models  Complex domain  discrete-time neural dynamics (DTND)  quasi-Newton Broyden-Fletcher-Goldfarb-Shanno (BFGS)  quadratic programming (QP)  
Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 6, 页码: 2064-2076
作者:  Li, Haoran;  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(4274Kb)  |  收藏  |  浏览/下载:328/110  |  提交时间:2020/08/03
Robot sensing systems  Navigation  Entropy  Neural networks  Task analysis  Planning  Automatic exploration  deep reinforcement learning (DRL)  optimal decision  partial observation  
BNAS: Efficient Neural Architecture Search Using Broad Scalable Architecture 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 期号: 0, 页码: 0
作者:  Ding ZX(丁子祥);  Yaran, Chen;  Nannan, Li;  Dingbin, Zhao;  Zhiquan, Sun;  C. L. Philip Chen
Adobe PDF(2713Kb)  |  收藏  |  浏览/下载:146/39  |  提交时间:2022/01/06
Broad convolutional neural network (BCNN), image classification, neural architecture search (NAS), reinforcement learning (RL)