CASIA OpenIR

浏览/检索结果: 共4条,第1-4条 帮助

限定条件                            
已选(0)清除 条数/页:   排序方式:
A Self-Attention-Based Deep Reinforcement Learning Approach for AGV Dispatching Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 12
作者:  Wei, Qinglai;  Yan, Yutian;  Zhang, Jie;  Xiao, Jun;  Wang, Cong
收藏  |  浏览/下载:215/0  |  提交时间:2023/01/09
Automated guided vehicle (AGV) dispatching  deep learning  reinforcement learning (RL)  self-attention  
VGN: Value Decomposition With Graph Attention Networks for Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 14
作者:  Wei, Qinglai;  Li, Yugu;  Zhang, Jie;  Wang, Fei-Yue
收藏  |  浏览/下载:219/0  |  提交时间:2022/07/25
Mathematical models  Task analysis  Games  Q-learning  Neural networks  Behavioral sciences  Training  Deep learning  graph attention networks (GATs)  multiagent systems  reinforcement learning  
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
收藏  |  浏览/下载:207/0  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Towards Better Generalization of Deep Neural Networks via Non-Typicality Sampling Scheme 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 11
作者:  Peng, Xinyu;  Wang, Fei-Yue;  Li, Li
收藏  |  浏览/下载:172/0  |  提交时间:2022/06/06
Training  Estimation  Deep learning  Standards  Optimization  Noise measurement  Convergence  Deep learning  generalization performance  nontypicality sampling scheme  stochastic gradient descent (SGD)