CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共25条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:64/4  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow  
Dynamic-horizon model-based value estimation with latent imagination 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-14
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(2305Kb)  |  收藏  |  浏览/下载:199/69  |  提交时间:2023/05/30
Latent world model  model-based value expansion (MVE)  reinforcement learning  reinforcement learning  
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:257/14  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Stacked BNAS: Rethinking Broad Convolutional Neural Network for Neural Architecture Search 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 0, 期号: 0, 页码: 0
作者:  Zixiang, Ding;  Yaran, Chen;  Nannan, Li;  Dongbin, Zhao;  C.L.Philip Chen,
Adobe PDF(764Kb)  |  收藏  |  浏览/下载:248/43  |  提交时间:2022/01/07
broad neural architecture search, stacked broad convolutional neural network, knowledge embedding search, image classification.  
BNAS: Efficient Neural Architecture Search Using Broad Scalable Architecture 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 期号: 0, 页码: 0
作者:  Ding ZX(丁子祥);  Yaran, Chen;  Nannan, Li;  Dingbin, Zhao;  Zhiquan, Sun;  C. L. Philip Chen
Adobe PDF(2713Kb)  |  收藏  |  浏览/下载:211/50  |  提交时间:2022/01/06
Broad convolutional neural network (BCNN), image classification, neural architecture search (NAS), reinforcement learning (RL)  
Adaptive Optimal Control of Heterogeneous CACC System With Uncertain Dynamics 期刊论文
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2019, 卷号: 27, 期号: 4, 页码: 1772-1779
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Zhong, Zhiguang
Adobe PDF(1189Kb)  |  收藏  |  浏览/下载:294/16  |  提交时间:2019/09/30
Adaptive optimal control  cooperative adaptive cruise control (CACC)  heterogeneous platoon  string stability  sum-of-squares polynomial  
Reinforcement Learning and Deep Learning based Lateral Control for Autonomous Driving 期刊论文
IEEE Computational Intelligence Magazine, IEEE Computational Intelligence Magazine, 2019, 2019, 卷号: 14, 14, 期号: 2, 页码: 83-98, 83-98
作者:  Dong Li;  Dongbin Zhao;  Qichao Zhang;  Yaran Chen
浏览  |  Adobe PDF(2205Kb)  |  收藏  |  浏览/下载:399/115  |  提交时间:2019/04/25
Deep Learning  Autonomous Driving  Visual Control  Reinforcement Learning  Deep Learning  Autonomous Driving  Visual Control  Reinforcement Learning  
StarCraft Micromanagement With Reinforcement Learning and Curriculum Transfer Learning 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2019, 卷号: 3, 期号: 1, 页码: 73-84
作者:  Kun Shao;  Yuanheng Zhu;  Dongbin Zhao
浏览  |  Adobe PDF(4125Kb)  |  收藏  |  浏览/下载:366/137  |  提交时间:2019/04/22
Reinforcement Learning, Transfer Learning, Curriculum Learning, Neural Network, Game Ai  
Deep Reinforcement Learning With Visual Attention for Vehicle Classification 期刊论文
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2017, 卷号: 9, 期号: 4, 页码: 356-367
作者:  Zhao, Dongbin;  Chen, Yaran;  Lv, Le
浏览  |  Adobe PDF(3192Kb)  |  收藏  |  浏览/下载:1073/547  |  提交时间:2017/05/08
Convolutional Neural Network (Cnn)  Reinforcement Learning  Vehicle Classification  Visual Attention  
Data-driven adaptive dynamic programming for continuous-time fully cooperative games with partially constrained inputs 期刊论文
NEUROCOMPUTING, 2017, 卷号: 238, 期号: *, 页码: 377-386
作者:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(1508Kb)  |  收藏  |  浏览/下载:671/284  |  提交时间:2017/05/04
Adaptive Dynamic Programming  Optimal Control  Neural Network  Fully Cooperative Games  Data-driven  Constrained Input