CASIA OpenIR

浏览/检索结果: 共8条,第1-8条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
A Self-Attention-Based Deep Reinforcement Learning Approach for AGV Dispatching Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 12
作者:  Wei, Qinglai;  Yan, Yutian;  Zhang, Jie;  Xiao, Jun;  Wang, Cong
收藏  |  浏览/下载:217/0  |  提交时间:2023/01/09
Automated guided vehicle (AGV) dispatching  deep learning  reinforcement learning (RL)  self-attention  
VGN: Value Decomposition With Graph Attention Networks for Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 14
作者:  Wei, Qinglai;  Li, Yugu;  Zhang, Jie;  Wang, Fei-Yue
收藏  |  浏览/下载:221/0  |  提交时间:2022/07/25
Mathematical models  Task analysis  Games  Q-learning  Neural networks  Behavioral sciences  Training  Deep learning  graph attention networks (GATs)  multiagent systems  reinforcement learning  
Attention Enhanced Reinforcement Learning for Multi agent Cooperation 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 15
作者:  Pu, Zhiqiang;  Wang, Huimu;  Liu, Zhen;  Yi, Jianqiang;  Wu, Shiguang
Adobe PDF(2967Kb)  |  收藏  |  浏览/下载:316/45  |  提交时间:2022/06/06
Training  Reinforcement learning  Games  Scalability  Task analysis  Standards  Optimization  Attention mechanism  deep reinforcement learning (DRL)  graph convolutional networks  multi agent systems  
Spiking Adaptive Dynamic Programming Based on Poisson Process for Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 11
作者:  Wei, Qinglai;  Han, Liyuan;  Zhang, Tielin
Adobe PDF(2904Kb)  |  收藏  |  浏览/下载:190/1  |  提交时间:2022/01/27
Maximum likelihood estimation (MLE)  Nonlinear systems  Optimal control  Poisson process  Spike train  Spiking Adaptive dynamic programming(SADP)  
Target Tracking Control of a Biomimetic Underwater Vehicle Through Deep Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Wang, Yu;  Tang, Chong;  Wang, Shuo;  Cheng, Long;  Wang, Rui;  Tan, Min;  Hou, Zengguang
收藏  |  浏览/下载:216/0  |  提交时间:2022/01/27
Reinforcement learning  Target tracking  Robots  Sports  Aerospace electronics  Mobile robots  Underwater vehicles  Biomimetic underwater vehicle (BUV)  reinforcement learning  target tracking control  
Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 12, 页码: 5245-5256
作者:  Wei, Qinglai;  Wang, Lingxiao;  Liu, Yu;  Polycarpou, Marios M.
Adobe PDF(4019Kb)  |  收藏  |  浏览/下载:341/78  |  提交时间:2021/03/08
Elevators  Optimal control  Backpropagation  Machine learning  Neural networks  Learning (artificial intelligence)  Actor  –critic  adaptive dynamic programming  deep learning (DL)  elevator group control (EGC)  optimal control  reinforcement learning (RL)  
Dynamical Channel Pruning by Conditional Accuracy Change for Deep Neural Networks 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 无, 期号: 无, 页码: 无
作者:  Chen, Zhiqiang;  Xu, Ting-Bing;  Du, Changde;  Liu, Cheng-Lin;  He, Huiguang
浏览  |  Adobe PDF(4352Kb)  |  收藏  |  浏览/下载:282/64  |  提交时间:2021/01/27
Conditional accuracy change (CAC), direct criterion, dynamical channel pruning, neural network compression, structure shaping.  
Accelerating Minibatch Stochastic Gradient Descent Using Typicality Sampling 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 11, 页码: 4649-4659
作者:  Peng, Xinyu;  Li, Li;  Wang, Fei-Yue
收藏  |  浏览/下载:232/0  |  提交时间:2021/01/06
Training  Convergence  Approximation algorithms  Stochastic processes  Estimation  Optimization  Acceleration  Batch selection  machine learning  minibatch stochastic gradient descent (SGD)  speed of convergence