CASIA OpenIR

浏览/检索结果: 共15条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Boosting On-Policy Actor-Critic With Shallow Updates in Critic 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 10
作者:  Li, Luntong;  Zhu, Yuanheng
收藏  |  浏览/下载:3/0  |  提交时间:2024/07/03
Artificial neural networks  Vectors  Task analysis  Training  Representation learning  Approximation algorithms  Optimization  Actor-critic  deep reinforcement learning (DRL)  proximal policy optimization (PPO)  shallow reinforcement learning (SRL)  
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:61/3  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow  
Theme-Aware Aesthetic Distribution Prediction With Full-Resolution Photographs 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 15
作者:  Jia, Gengyun;  Li, Peipei;  He, Ran
Adobe PDF(10845Kb)  |  收藏  |  浏览/下载:281/42  |  提交时间:2022/06/06
Aesthetic quality assessment (AQA)  full resolution  region of image (RoM) pooling  theme  
Attention Enhanced Reinforcement Learning for Multi agent Cooperation 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 15
作者:  Pu, Zhiqiang;  Wang, Huimu;  Liu, Zhen;  Yi, Jianqiang;  Wu, Shiguang
Adobe PDF(2967Kb)  |  收藏  |  浏览/下载:349/52  |  提交时间:2022/06/06
Training  Reinforcement learning  Games  Scalability  Task analysis  Standards  Optimization  Attention mechanism  deep reinforcement learning (DRL)  graph convolutional networks  multi agent systems  
Model-Free Adaptive Optimal Control for Unknown Nonlinear Multiplayer Nonzero-Sum Game 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 2, 页码: 879-892
作者:  Wei, Qinglai;  Zhu, Liao;  Song, Ruizhuo;  Zhang, Pinjia;  Liu, Derong;  Xiao, Jun
收藏  |  浏览/下载:276/0  |  提交时间:2022/03/17
Heuristic algorithms  Nonlinear systems  Optimal control  Mathematical model  Dynamic programming  Games  Adaptive systems  Adaptive dynamic programming (ADP)  globalized dual-heuristic dynamic programming (GDHP)  multiplayer nonzero-sum game (MP-NZSG)  neural network (NN)  
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Adobe PDF(4187Kb)  |  收藏  |  浏览/下载:254/10  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
Spiking Adaptive Dynamic Programming Based on Poisson Process for Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 11
作者:  Wei, Qinglai;  Han, Liyuan;  Zhang, Tielin
Adobe PDF(2904Kb)  |  收藏  |  浏览/下载:227/11  |  提交时间:2022/01/27
Maximum likelihood estimation (MLE)  Nonlinear systems  Optimal control  Poisson process  Spike train  Spiking Adaptive dynamic programming(SADP)  
Neuro-Optimal Trajectory Tracking With Value Iteration of Discrete-Time Nonlinear Dynamics 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Wang, Ding;  Ha, Mingming;  Cheng, Long
收藏  |  浏览/下载:278/0  |  提交时间:2022/01/27
Trajectory  Heuristic algorithms  Convergence  Trajectory tracking  Stability criteria  Optimal control  Dynamic programming  Adaptive critic design  discrete-time nonlinear plants  neuro-optimal trajectory tracking  uniformly ultimately bounded stability  value iteration  
Decentralized Event-Driven Constrained Control Using Adaptive Critic Designs 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 15
作者:  Yang, Xiong;  Zhu, Yuanheng;  Dong, Na;  Wei, Qinglai
Adobe PDF(1578Kb)  |  收藏  |  浏览/下载:229/12  |  提交时间:2022/01/27
Adaptive critic designs (ACDs)  adaptive dynamic programming (ADP)  decentralized event-driven control  input constraint  reinforcement learning (RL)  
Broad Learning System Based on Maximum Correntropy Criterion 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 卷号: 32, 期号: 7, 页码: 3083-3097
作者:  Zheng, Yunfei;  Chen, Badong;  Wang, Shiyuan;  Wang, Weiqun
收藏  |  浏览/下载:213/0  |  提交时间:2021/08/15
Learning systems  Robustness  Standards  Optimization  Training  Perturbation methods  Mean square error methods  Broad learning system (BLS)  incremental learning algorithms  maximum correntropy criterion (MCC)  regression and classification