CASIA OpenIR

浏览/检索结果: 共6条,第1-6条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Attention Enhanced Reinforcement Learning for Multi agent Cooperation 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 15
作者:  Pu, Zhiqiang;  Wang, Huimu;  Liu, Zhen;  Yi, Jianqiang;  Wu, Shiguang
Adobe PDF(2967Kb)  |  收藏  |  浏览/下载:302/41  |  提交时间:2022/06/06
Training  Reinforcement learning  Games  Scalability  Task analysis  Standards  Optimization  Attention mechanism  deep reinforcement learning (DRL)  graph convolutional networks  multi agent systems  
You Only Search Once: Single Shot Neural Architecture Search via Direct Sparse Optimization 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 卷号: 43, 期号: 9, 页码: 2891-2904
作者:  Zhang, Xinbang;  Huang, Zehao;  Wang, Naiyan;  Xiang, Shiming;  Pan, Chunhong
Adobe PDF(1271Kb)  |  收藏  |  浏览/下载:276/54  |  提交时间:2021/11/02
Computer architecture  Optimization  Learning (artificial intelligence)  Task analysis  Acceleration  Evolutionary computation  Convolution  Neural architecture search(NAS)  convolution neural network  sparse optimization  
Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 12, 页码: 5245-5256
作者:  Wei, Qinglai;  Wang, Lingxiao;  Liu, Yu;  Polycarpou, Marios M.
Adobe PDF(4019Kb)  |  收藏  |  浏览/下载:324/75  |  提交时间:2021/03/08
Elevators  Optimal control  Backpropagation  Machine learning  Neural networks  Learning (artificial intelligence)  Actor  –critic  adaptive dynamic programming  deep learning (DL)  elevator group control (EGC)  optimal control  reinforcement learning (RL)  
Engagement Enhancement Based on Human-in-the-Loop Optimization for Neural Rehabilitation 期刊论文
FRONTIERS IN NEUROROBOTICS, 2020, 卷号: 12, 期号: 无, 页码: 11
作者:  Wang, Jiaxing;  Wang, Weiqun;  Ren, Shixin;  Shi, Weiguo;  Hou, Zeng-Guang
Adobe PDF(2370Kb)  |  收藏  |  浏览/下载:288/67  |  提交时间:2021/01/06
human-in-the-loop optimization  EEG based neural engagement  sEMG based muscle activation  tracking accuracy  neural rehabilitation  
A Pareto optimal mechanism for demand-side platforms in real time bidding advertising markets 期刊论文
INFORMATION SCIENCES, 2018, 卷号: 469, 页码: 119-140
作者:  Qin, Rui;  Yuan, Yong;  Wang, Fei-Yue
浏览  |  Adobe PDF(1804Kb)  |  收藏  |  浏览/下载:499/159  |  提交时间:2018/09/20
Computational advertising  Real time bidding  Demand side platform  Pareto optimal  Mechanism design  Computational experiment  
A Gradient-Based Reinforcement Learning Algorithm for Multiple Cooperative Agents 期刊论文
IEEE ACCESS, 2018, 卷号: 6, 页码: 70223-70235
作者:  Zhang, Zhen;  Wang, Dongqing;  Zhao, Dongbin;  Han, Qiaoni;  Song, Tingting
收藏  |  浏览/下载:242/0  |  提交时间:2019/07/12
Multi-agent reinforcement learning  gradient ascent  Q-learning  cooperative tasks