CASIA OpenIR

浏览/检索结果: 共12条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
收藏  |  浏览/下载:216/0  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
EDP: An Efficient Decomposition and Pruning Scheme for Convolutional Neural Network Compression 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 32, 期号: 0, 页码: 0
作者:  Ruan, Xiaofeng;  Liu, Yufan;  Yuan, Chunfeng;  Li, Bing;  Hu, Weiming;  Li, Yangxi;  Maybank, Stephen
Adobe PDF(3625Kb)  |  收藏  |  浏览/下载:303/43  |  提交时间:2021/06/17
Data-driven  low-rank decomposition  model compression and acceleration  structured pruning  
Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 12, 页码: 5245-5256
作者:  Wei, Qinglai;  Wang, Lingxiao;  Liu, Yu;  Polycarpou, Marios M.
Adobe PDF(4019Kb)  |  收藏  |  浏览/下载:344/78  |  提交时间:2021/03/08
Elevators  Optimal control  Backpropagation  Machine learning  Neural networks  Learning (artificial intelligence)  Actor  –critic  adaptive dynamic programming  deep learning (DL)  elevator group control (EGC)  optimal control  reinforcement learning (RL)  
Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 6, 页码: 2064-2076
作者:  Li, Haoran;  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(4274Kb)  |  收藏  |  浏览/下载:380/117  |  提交时间:2020/08/03
Robot sensing systems  Navigation  Entropy  Neural networks  Task analysis  Planning  Automatic exploration  deep reinforcement learning (DRL)  optimal decision  partial observation  
Adaptive Constrained Optimal Control Design for Data-Based Nonlinear Discrete-Time Systems With Critic-Only Structure 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 6, 页码: 2099-2111
作者:  Luo, Biao;  Liu, Derong;  Wu, Huai-Ning
浏览  |  Adobe PDF(1045Kb)  |  收藏  |  浏览/下载:383/117  |  提交时间:2018/10/10
Adaptive Control  Adaptive Dynamic Programming  Constraints  Critic-only  Data-based  Optimal Control  Q-learning  
Robust C-Loss Kernel Classifiers 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 3, 页码: 510-522
作者:  Xu, Guibiao;  Hu, Bao-Gang;  Principe, Jose C.
浏览  |  Adobe PDF(3169Kb)  |  收藏  |  浏览/下载:395/157  |  提交时间:2018/01/05
Correntropy  Half-quadratic (Hq) Optimization  Kernel Classifier  Loss Function  
Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 卷号: 28, 期号: 3, 页码: 714-725
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun
浏览  |  Adobe PDF(547Kb)  |  收藏  |  浏览/下载:448/186  |  提交时间:2017/05/05
Adaptive Dynamic Programming (Adp)  H-infinity Control  Policy Iteration (Pi)  Zero-sum Game (Zsg)  
A pdf-Free Change Detection Test Based on Density Difference Estimation 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 2, 页码: 324-334
作者:  Bu, Li;  Alippi, Cesare;  Zhao, Dongbin
浏览  |  Adobe PDF(2468Kb)  |  收藏  |  浏览/下载:379/109  |  提交时间:2017/05/04
Concept Drift  Least Squares Density-difference (Lsdd)-based Method  Probability Density Function (Pdf)-free  Three-level Threshold Mechanism  
Model-Free Optimal Tracking Control via Critic-Only Q-Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 卷号: 27, 期号: 10, 页码: 2134-2144
作者:  Luo, Biao;  Liu, Derong;  Huang, Tingwen;  Wang, Ding;  Luo,Biao
Adobe PDF(1521Kb)  |  收藏  |  浏览/下载:586/287  |  提交时间:2016/10/24
Critic-only Q-learning (Coql)  Model-free  Nonaffine Nonlinear Systems  Optimal Tracking Control  
The Twist Tensor Nuclear Norm for Video Completion 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 卷号: 28, 期号: 12, 页码: 2961-2973
作者:  Hu, Wenrui;  Tao, Dacheng;  Zhang, Wensheng;  Xie, Yuan;  Yang, Yehui;  Wensheng Zhang
浏览  |  Adobe PDF(24685Kb)  |  收藏  |  浏览/下载:485/144  |  提交时间:2016/10/22
Low-rank Tensor Estimation (Lrte)  Tensor Multirank  Tensor Nuclear Norm (Tnn)  Twist Tensor  Video Completion