CASIA OpenIR

浏览/检索结果: 共4条,第1-4条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:237/7  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Learning Skill Characteristics From Manipulations 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 15
作者:  Zhou, Xiao-Hu;  Xie, Xiao-Liang;  Liu, Shi-Qi;  Ni, Zhen-Liang;  Zhou, Yan-Jie;  Li, Rui-Qi;  Gui, Mei-Jiang;  Fan, Chen-Chen;  Feng, Zhen-Qiu;  Bian, Gui-Bin;  Hou, Zeng-Guang
收藏  |  浏览/下载:346/0  |  提交时间:2022/06/10
Surgery  Sensors  In vivo  Task analysis  Arteries  Measurement  Sensor phenomena and characterization  Ensemble learning  in vivo porcine studies  percutaneous coronary intervention  skill characteristics  wavelet packet decomposition (WPD)  
PWSNAS: Powering Weight Sharing NAS With General Search Space Shrinking Framework 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 14
作者:  Hu, Yiming;  Wang, Xingang;  Gu, Qingyi
收藏  |  浏览/下载:232/0  |  提交时间:2022/06/10
Computer architecture  Training  Optimization  Extraterrestrial measurements  Estimation  Computational modeling  Search problems  Metric  neural architecture search (NAS)  search space shrinking  weight sharing  
Attention Enhanced Reinforcement Learning for Multi agent Cooperation 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 15
作者:  Pu, Zhiqiang;  Wang, Huimu;  Liu, Zhen;  Yi, Jianqiang;  Wu, Shiguang
Adobe PDF(2967Kb)  |  收藏  |  浏览/下载:349/52  |  提交时间:2022/06/06
Training  Reinforcement learning  Games  Scalability  Task analysis  Standards  Optimization  Attention mechanism  deep reinforcement learning (DRL)  graph convolutional networks  multi agent systems