CASIA OpenIR

浏览/检索结果: 共6条,第1-6条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:60/2  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow  
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:236/6  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Boost 3-D Object Detection via Point Clouds Segmentation and Fused 3-D GIoU-L-1 Loss 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 2, 页码: 762-773
作者:  Chen, Yaran;  Li, Haoran;  Gao, Ruiyuan;  Zhao, Dongbin
Adobe PDF(2082Kb)  |  收藏  |  浏览/下载:284/65  |  提交时间:2022/03/17
3-D object detection  generalized Intersection of Union (GIoU) loss  segmentation  
BNAS: Efficient Neural Architecture Search Using Broad Scalable Architecture 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 期号: 0, 页码: 0
作者:  Ding ZX(丁子祥);  Yaran, Chen;  Nannan, Li;  Dingbin, Zhao;  Zhiquan, Sun;  C. L. Philip Chen
Adobe PDF(2713Kb)  |  收藏  |  浏览/下载:194/45  |  提交时间:2022/01/06
Broad convolutional neural network (BCNN), image classification, neural architecture search (NAS), reinforcement learning (RL)  
Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 12, 页码: 5245-5256
作者:  Wei, Qinglai;  Wang, Lingxiao;  Liu, Yu;  Polycarpou, Marios M.
Adobe PDF(4019Kb)  |  收藏  |  浏览/下载:371/84  |  提交时间:2021/03/08
Elevators  Optimal control  Backpropagation  Machine learning  Neural networks  Learning (artificial intelligence)  Actor  –critic  adaptive dynamic programming  deep learning (DL)  elevator group control (EGC)  optimal control  reinforcement learning (RL)  
Event-Based Robust Control for Uncertain Nonlinear Systems Using Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 1, 页码: 37-50
作者:  Zhang, Qichao;  Zhao, Dongbin;  Wang, Ding
浏览  |  Adobe PDF(2233Kb)  |  收藏  |  浏览/下载:561/252  |  提交时间:2017/05/04
Adaptive Dynamic Programming (Adp)  Event-based Control  Neural Network (Nn)  Robust Control  Unmatched Uncertainties