CASIA OpenIR

浏览/检索结果: 共15条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 页码: 1-13
作者:  Guangzheng Hu;  Yuanheng Zhu;  Haoran Li;  Dongbin Zhao
Adobe PDF(2144Kb)  |  收藏  |  浏览/下载:37/7  |  提交时间:2024/06/05
Games  Q-learning  Task analysis  Optimization  Convergence  Training  Nash equilibrium  Multi-agent reinforcement learning  minimax-Q learning  two-team zero-sum Markov games  
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Adobe PDF(4187Kb)  |  收藏  |  浏览/下载:260/12  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
BNAS: Efficient Neural Architecture Search Using Broad Scalable Architecture 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 期号: 0, 页码: 0
作者:  Ding ZX(丁子祥);  Yaran, Chen;  Nannan, Li;  Dingbin, Zhao;  Zhiquan, Sun;  C. L. Philip Chen
Adobe PDF(2713Kb)  |  收藏  |  浏览/下载:199/47  |  提交时间:2022/01/06
Broad convolutional neural network (BCNN), image classification, neural architecture search (NAS), reinforcement learning (RL)  
Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 12
作者:  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1431Kb)  |  收藏  |  浏览/下载:317/60  |  提交时间:2021/12/28
Reinforcement learning  Missile guidance  Auxiliary learning  Self-imitation learning  
Heuristic rank selection with progressively searching tensor ring network 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 15
作者:  Li, Nannan;  Pan, Yu;  Chen, Yaran;  Ding, Zixiang;  Zhao, Dongbin;  Xu, Zenglin
Adobe PDF(1305Kb)  |  收藏  |  浏览/下载:309/56  |  提交时间:2021/04/27
Tensor ring networks  Rank selection  Progressively search  Image classification  
Comparison of 3D Object Detection Based on LiDAR Point Cloud 会议论文
, Dali, China, 2019-5-24
作者:  Li, Haoran;  Zhou, Xiaolei;  Chen, Yaran;  Zhang, Qichao;  Zhao, Dongbin;  Qian, Dianwei
浏览  |  Adobe PDF(296Kb)  |  收藏  |  浏览/下载:240/99  |  提交时间:2020/09/02
Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 8, 页码: 2874-2885
作者:  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(1021Kb)  |  收藏  |  浏览/下载:455/135  |  提交时间:2019/07/12
Integral reinforcement learning (IRL)  neural network (NN)  nonzero-sum (NZS) games  off-policy  single-critic  unknown drift dynamics  
Off-Policy Reinforcement Learning for Partially Unknown Nonzero-Sum Games 会议论文
, Guangzhou China, November 14–18
作者:  Zhang,Qichao;  Zhao,Dongbin;  Zhang,Sibo
浏览  |  Adobe PDF(119Kb)  |  收藏  |  浏览/下载:275/105  |  提交时间:2017/12/28
Comprehensive comparison of online ADP algorithms for continuous-time optimal control 期刊论文
ARTIFICIAL INTELLIGENCE REVIEW, 2018, 卷号: 49, 期号: 4, 页码: 531-547
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(766Kb)  |  收藏  |  浏览/下载:429/189  |  提交时间:2017/09/13
Adaptive Dynamic Programming  Policy Iteration  Integral Reinforcement Learning  Experience Replay  Off-policy  
A Semi-Supervised Predictive Sparse Decomposition Based on Task-Driven Dictionary Learning 期刊论文
COGNITIVE COMPUTATION, 2017, 卷号: 9, 期号: 1, 页码: 115-124
作者:  Lv Le;  Zhao Dongbin;  Deng QingQiong
浏览  |  Adobe PDF(998Kb)  |  收藏  |  浏览/下载:422/141  |  提交时间:2017/05/08
Semi-supervised Learning  Predictive Sparse Decomposition  Neural Networks  Dictionary Learning