CASIA OpenIR

浏览/检索结果: 共6条,第1-6条 帮助

限定条件                            
已选(0)清除 条数/页:   排序方式:
Rethinking Pretraining as a Bridge From ANNs to SNNs 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 14
作者:  Lin, Yihan;  Hu, Yifan;  Ma, Shijie;  Yu, Dongjie;  Li, Guoqi
收藏  |  浏览/下载:283/0  |  提交时间:2023/03/20
Training  Task analysis  Neurons  Pipelines  Artificial neural networks  Feature extraction  Transfer learning  Event-driven dataset  neural network (NN) analysis  pretraining technique  spiking NN (SNN)  transfer learning  
A Self-Attention-Based Deep Reinforcement Learning Approach for AGV Dispatching Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 12
作者:  Wei, Qinglai;  Yan, Yutian;  Zhang, Jie;  Xiao, Jun;  Wang, Cong
收藏  |  浏览/下载:230/0  |  提交时间:2023/01/09
Automated guided vehicle (AGV) dispatching  deep learning  reinforcement learning (RL)  self-attention  
VGN: Value Decomposition With Graph Attention Networks for Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 14
作者:  Wei, Qinglai;  Li, Yugu;  Zhang, Jie;  Wang, Fei-Yue
收藏  |  浏览/下载:232/0  |  提交时间:2022/07/25
Mathematical models  Task analysis  Games  Q-learning  Neural networks  Behavioral sciences  Training  Deep learning  graph attention networks (GATs)  multiagent systems  reinforcement learning  
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:223/3  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Towards Better Generalization of Deep Neural Networks via Non-Typicality Sampling Scheme 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 11
作者:  Peng, Xinyu;  Wang, Fei-Yue;  Li, Li
收藏  |  浏览/下载:185/0  |  提交时间:2022/06/06
Training  Estimation  Deep learning  Standards  Optimization  Noise measurement  Convergence  Deep learning  generalization performance  nontypicality sampling scheme  stochastic gradient descent (SGD)  
Model-Free Adaptive Optimal Control for Unknown Nonlinear Multiplayer Nonzero-Sum Game 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 2, 页码: 879-892
作者:  Wei, Qinglai;  Zhu, Liao;  Song, Ruizhuo;  Zhang, Pinjia;  Liu, Derong;  Xiao, Jun
收藏  |  浏览/下载:262/0  |  提交时间:2022/03/17
Heuristic algorithms  Nonlinear systems  Optimal control  Mathematical model  Dynamic programming  Games  Adaptive systems  Adaptive dynamic programming (ADP)  globalized dual-heuristic dynamic programming (GDHP)  multiplayer nonzero-sum game (MP-NZSG)  neural network (NN)