CASIA OpenIR

浏览/检索结果: 共5条,第1-5条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Multi-UAV Cooperative Short-Range Combat via Attention-Based Reinforcement Learning using Individual Reward Shaping 会议论文
, Kyoto, Japan, October 23-27, 2022
作者:  Zhang TL(张天乐);  Qiu TH(丘腾海);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(896Kb)  |  收藏  |  浏览/下载:188/62  |  提交时间:2023/06/12
A Brain-Inspired Theory of Mind Spiking Neural Network for Reducing Safety Risks of Other Agents (vol 16, 753900, 2022) 期刊论文
FRONTIERS IN NEUROSCIENCE, 2022, 卷号: 16, 页码: 2
作者:  Zhao, Zhuoya;  Lu, Enmeng;  Zhao, Feifei;  Zeng, Yi;  Zhao, Yuxuan
Adobe PDF(4502Kb)  |  收藏  |  浏览/下载:179/15  |  提交时间:2022/07/25
brain-inspired model  safety risks  SNNs  R-STDP  theory of mind  
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:258/15  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Supervised assisted deep reinforcement learning for emergency voltage control of power systems 期刊论文
NEUROCOMPUTING, 2022, 卷号: 475, 页码: 69-79
作者:  Li, Xiaoshuang;  Wang, Xiao;  Zheng, Xinhu;  Dai, Yuxin;  Yu, Zhihong;  Zhang, Jun Jason;  Bu, Guangquan;  Wang, Fei-Yue
Adobe PDF(2551Kb)  |  收藏  |  浏览/下载:364/78  |  提交时间:2022/06/06
Deep reinforcement learning  Behavioral cloning  Dynamic demonstration  Emergency control  
SADRL: Merging human experience with machine intelligence via supervised assisted deep reinforcement learning 期刊论文
NEUROCOMPUTING, 2022, 卷号: 467, 页码: 300-309
作者:  Li, Xiaoshuang;  Wang, Xiao;  Zheng, Xinhu;  Jin, Junchen;  Huang, Yanhao;  Zhang, Jun Jason;  Wang, Fei-Yue
Adobe PDF(1244Kb)  |  收藏  |  浏览/下载:351/79  |  提交时间:2021/12/28
Deep reinforcement learning  Behavioral cloning  Dynamic demonstration  Double DQN