CASIA OpenIR

浏览/检索结果: 共9条,第1-9条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:251/12  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Theme-Aware Aesthetic Distribution Prediction With Full-Resolution Photographs 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 15
作者:  Jia, Gengyun;  Li, Peipei;  He, Ran
Adobe PDF(10845Kb)  |  收藏  |  浏览/下载:290/46  |  提交时间:2022/06/06
Aesthetic quality assessment (AQA)  full resolution  region of image (RoM) pooling  theme  
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Adobe PDF(4187Kb)  |  收藏  |  浏览/下载:264/12  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 6, 页码: 2064-2076
作者:  Li, Haoran;  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(4274Kb)  |  收藏  |  浏览/下载:412/126  |  提交时间:2020/08/03
Robot sensing systems  Navigation  Entropy  Neural networks  Task analysis  Planning  Automatic exploration  deep reinforcement learning (DRL)  optimal decision  partial observation  
Adaptive Constrained Optimal Control Design for Data-Based Nonlinear Discrete-Time Systems With Critic-Only Structure 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 6, 页码: 2099-2111
作者:  Luo, Biao;  Liu, Derong;  Wu, Huai-Ning
浏览  |  Adobe PDF(1045Kb)  |  收藏  |  浏览/下载:401/122  |  提交时间:2018/10/10
Adaptive Control  Adaptive Dynamic Programming  Constraints  Critic-only  Data-based  Optimal Control  Q-learning  
LMI Conditions for Global Stability of Fractional-Order Neural Networks 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 卷号: 28, 期号: 10, 页码: 2423-2433
作者:  Zhang, Shuo;  Yu, Yongguang;  Yu, Junzhi
浏览  |  Adobe PDF(3938Kb)  |  收藏  |  浏览/下载:539/191  |  提交时间:2017/01/23
Fractional Order  Generalized Projective Synchronization (Gps)  Linear Matrix Inequality (Lmi)  Neural Networks  Stability  
CPG Network Optimization for a Biomimetic Robotic Fish via PSO 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 卷号: 27, 期号: 9, 页码: 1962-1968
作者:  Yu, Junzhi;  Wu, Zhengxing;  Wang, Ming;  Tan, Min
浏览  |  Adobe PDF(1219Kb)  |  收藏  |  浏览/下载:366/141  |  提交时间:2016/12/26
Central Pattern Generator (Cpg)  Neurodynamic Systems  Parameter Optimization  Robotic Fish  
The Twist Tensor Nuclear Norm for Video Completion 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 卷号: 28, 期号: 12, 页码: 2961-2973
作者:  Hu, Wenrui;  Tao, Dacheng;  Zhang, Wensheng;  Xie, Yuan;  Yang, Yehui;  Wensheng Zhang
浏览  |  Adobe PDF(24685Kb)  |  收藏  |  浏览/下载:516/149  |  提交时间:2016/10/22
Low-rank Tensor Estimation (Lrte)  Tensor Multirank  Tensor Nuclear Norm (Tnn)  Twist Tensor  Video Completion  
Adaptive Optimal Control of Highly Dissipative Nonlinear Spatially Distributed Processes With Neuro-Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 4, 页码: 684-696
作者:  Luo, Biao;  Wu, Huai-Ning;  Li, Han-Xiong
浏览  |  Adobe PDF(2465Kb)  |  收藏  |  浏览/下载:377/123  |  提交时间:2016/03/30
Adaptive Optimal Control  Empirical Eigenfunction (Eef)  Highly Dissipative Partial Differential Equations (Pdes)  Neuro-dynamic Programming (Ndp)  Spatially Distributed Processes (Sdps)