CASIA OpenIR

浏览/检索结果: 共9条,第1-9条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:252/12  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Decentralized Event-Driven Constrained Control Using Adaptive Critic Designs 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 15
作者:  Yang, Xiong;  Zhu, Yuanheng;  Dong, Na;  Wei, Qinglai
Adobe PDF(1578Kb)  |  收藏  |  浏览/下载:239/15  |  提交时间:2022/01/27
Adaptive critic designs (ACDs)  adaptive dynamic programming (ADP)  decentralized event-driven control  input constraint  reinforcement learning (RL)  
Stacked BNAS: Rethinking Broad Convolutional Neural Network for Neural Architecture Search 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 0, 期号: 0, 页码: 0
作者:  Zixiang, Ding;  Yaran, Chen;  Nannan, Li;  Dongbin, Zhao;  C.L.Philip Chen,
Adobe PDF(764Kb)  |  收藏  |  浏览/下载:244/41  |  提交时间:2022/01/07
broad neural architecture search, stacked broad convolutional neural network, knowledge embedding search, image classification.  
Manifold Warp Segmentation of Human Action 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 5, 页码: 1414-1426
作者:  Liu, Shenglan;  Feng, Lin;  Liu, Yang;  Qiao, Hong;  Wu, Jun;  Wang, Wei
浏览  |  Adobe PDF(3667Kb)  |  收藏  |  浏览/下载:429/142  |  提交时间:2018/01/06
Curvature  Dimensionality Reduction  Human Action Segmentation  Space Alignment  
A pdf-Free Change Detection Test Based on Density Difference Estimation 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 2, 页码: 324-334
作者:  Bu, Li;  Alippi, Cesare;  Zhao, Dongbin
浏览  |  Adobe PDF(2468Kb)  |  收藏  |  浏览/下载:411/119  |  提交时间:2017/05/04
Concept Drift  Least Squares Density-difference (Lsdd)-based Method  Probability Density Function (Pdf)-free  Three-level Threshold Mechanism  
Infinite Horizon Self-Learning Optimal Control of Nonaffine Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 4, 页码: 866-879
作者:  Wei, Qinglai;  Liu, Derong;  Yang, Xiong
浏览  |  Adobe PDF(2408Kb)  |  收藏  |  浏览/下载:311/125  |  提交时间:2015/09/21
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Generalized Policy Iteration  Neural Networks (Nns)  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning  
MEC-A Near-Optimal Online Reinforcement Learning Algorithm for Continuous Deterministic Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 2, 页码: 346-356
作者:  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(2156Kb)  |  收藏  |  浏览/下载:297/116  |  提交时间:2015/09/18
Efficient Exploration  Probably Approximately Correct (Pac)  Reinforcement Learning (Rl)  State Aggregation  
Decentralized Stabilization for a Class of Continuous-Time Nonlinear Interconnected Systems Using Online Learning Optimal Control Approach 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 卷号: 25, 期号: 2, 页码: 418-428
作者:  Liu, Derong;  Wang, Ding;  Li, Hongliang
Adobe PDF(1311Kb)  |  收藏  |  浏览/下载:304/110  |  提交时间:2015/08/12
Adaptive Dynamic Programming  Decentralized Control  Large-scale Systems  Neural Networks  Nonlinear Interconnected Systems  Optimal Control  Policy Iteration  Reinforcement Learning  
What Are the Differences Between Bayesian Classifiers and Mutual-Information Classifiers? 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 卷号: 25, 期号: 2, 页码: 249-264
作者:  Hu, Bao-Gang
浏览  |  Adobe PDF(1314Kb)  |  收藏  |  浏览/下载:230/30  |  提交时间:2015/08/12
Abstaining Classifier  Bayes  Cost-sensitive Learning  Entropy  Error Types  Mutual Information  Reject Types