CASIA OpenIR

浏览/检索结果: 共7条,第1-7条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Chai, Jiajun;  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin;  Ma, Zhe;  Sun, Kewu;  Ding, Jishiyu
Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:243/26  |  提交时间:2022/01/27
Multi-agent systems  Training  Task analysis  Reinforcement learning  Sun  Learning systems  Semantics  Centralized training with decentralized execution (CTDE)  multiagent  reinforcement learning  StarCraft II  
Discrete-Time Stable Generalized Self-Learning Optimal Control With Approximation Errors 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 4, 页码: 1226-1238
作者:  Wei, Qinglai;  Li, Benkai;  Song, Ruizhuo
浏览  |  Adobe PDF(2475Kb)  |  收藏  |  浏览/下载:388/125  |  提交时间:2017/02/23
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Generalized Policy Iteration (Gpi)  Neural Networks  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning  
In Defense of Locality-Sensitive Hashing 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 1, 页码: 87-103
作者:  Ding, Kun;  Huo, Chunlei;  Fan, Bin;  Xiang, Shiming;  Pan, Chunhong;  Fan B(樊斌)
浏览  |  Adobe PDF(2975Kb)  |  收藏  |  浏览/下载:520/185  |  提交时间:2016/10/24
Locality-sensitive Hashing (Lsh)  Semantic Similarity Search  Two-step Hashing  
Data-Driven Zero-Sum Neuro-Optimal Control for a Class of Continuous-Time Unknown Nonlinear Systems With Disturbance Using ADP 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 卷号: 27, 期号: 2, 页码: 444-458
作者:  Wei, Qinglai;  Song, Ruizhuo;  Yan, Pengfei
浏览  |  Adobe PDF(2204Kb)  |  收藏  |  浏览/下载:414/137  |  提交时间:2016/06/14
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Recurrent Neural Network (Rnn)  Reinforcement Learning  
Discriminative Least Squares Regression for Multiclass Classification and Feature Selection 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 卷号: 23, 期号: 11, 页码: 1738-1754
作者:  Xiang, Shiming;  Nie, Feiping;  Meng, Gaofeng;  Pan, Chunhong;  Zhang, Changshui
浏览  |  Adobe PDF(725Kb)  |  收藏  |  浏览/下载:676/218  |  提交时间:2015/09/18
Feature Selection  Least Squares Regression  Multiclass Classification  Sparse Learning  
Error Bounds of Adaptive Dynamic Programming Algorithms for Solving Undiscounted Optimal Control Problems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 6, 页码: 1323-1334
作者:  Liu, Derong;  Li, Hongliang;  Wang, Ding
Adobe PDF(1114Kb)  |  收藏  |  浏览/下载:292/88  |  提交时间:2015/09/17
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Neural Networks  Neurodynamic Programming  Nonlinear Systems  Optimal Control  
What Are the Differences Between Bayesian Classifiers and Mutual-Information Classifiers? 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 卷号: 25, 期号: 2, 页码: 249-264
作者:  Hu, Bao-Gang
浏览  |  Adobe PDF(1314Kb)  |  收藏  |  浏览/下载:205/26  |  提交时间:2015/08/12
Abstaining Classifier  Bayes  Cost-sensitive Learning  Entropy  Error Types  Mutual Information  Reject Types