CASIA OpenIR

浏览/检索结果: 共24条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 卷号: 54, 期号: 3, 页码: 1414-1426
作者:  Li, Tao;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(784Kb)  |  收藏  |  浏览/下载:77/6  |  提交时间:2024/02/22
Performance analysis  Optimal control  Dynamic programming  Iterative algorithms  Upper bound  Measurement  Convergence  Adaptive dynamic programming (ADP)  isoperimetric constraints  nonlinear systems  optimal control  policy iteration  
Continuous-Time Stochastic Policy Iteration of Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 13
作者:  Wei, Qinglai;  Zhou, Tianmin;  Lu, Jingwei;  Liu, Yu;  Su, Shuai;  Xiao, Jun
收藏  |  浏览/下载:135/0  |  提交时间:2023/11/17
Adaptive dynamic programming (ADP)  Hamilton-Jacobi-Bellman equation (HJBE)  nonlinear stochastic system  stochastic policy iteration (PI)  
Meta Graph Transformer: A Novel Framework for Spatial-Temporal Traffic Prediction 期刊论文
NEUROCOMPUTING, 2022, 卷号: 491, 页码: 544-563
作者:  Ye, Xue;  Fang, Shen;  Sun, Fang;  Zhang, Chunxia;  Xiang, Shiming
Adobe PDF(3491Kb)  |  收藏  |  浏览/下载:246/32  |  提交时间:2022/09/19
Traffic prediction  Spatial-temporal modeling  Meta-learning  Attention mechanism  Deep learning  
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:220/2  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Generalization on Unseen Domains via Model-Agnostic Learning for Intelligent Fault Diagnosis 期刊论文
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 卷号: 71, 页码: 11
作者:  Wang, Huanjie;  Bai, Xiwei;  Wang, Sihan;  Tan, Jie;  Liu, Chengbao
Adobe PDF(3477Kb)  |  收藏  |  浏览/下载:286/52  |  提交时间:2022/06/06
Data-driven fault diagnosis  Domain generalization  Model-agnostic learning  Rolling bearing  
Attention Enhanced Reinforcement Learning for Multi agent Cooperation 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 15
作者:  Pu, Zhiqiang;  Wang, Huimu;  Liu, Zhen;  Yi, Jianqiang;  Wu, Shiguang
Adobe PDF(2967Kb)  |  收藏  |  浏览/下载:331/48  |  提交时间:2022/06/06
Training  Reinforcement learning  Games  Scalability  Task analysis  Standards  Optimization  Attention mechanism  deep reinforcement learning (DRL)  graph convolutional networks  multi agent systems  
Inductive Spatiotemporal Graph Convolutional Networks for Short-term Quantitative Precipitation Forecasting 期刊论文
IEEE Transactions on Geoscience and Remote Sensing, IEEE Transactions on Geoscience and Remote Sensing, 2022, 2022, 卷号: 0, 0, 期号: 0, 页码: 0, 0
作者:  Yajing, Wu;  Xuebing, Yang;  Yongqiang, Tang;  Chenyang, Zhang;  Guoping, Zhang;  Wensheng, Zhang
Adobe PDF(10052Kb)  |  收藏  |  浏览/下载:309/72  |  提交时间:2022/04/06
Quantitative precipitation forecasting  graph convolutional networks (GCN)  spatiotemporal model  radar-rain gauge data merging  Quantitative precipitation forecasting  graph convolutional networks (GCN)  spatiotemporal model  radar-rain gauge data merging  
Question-Guided Erasing-Based Spatiotemporal Attention Learning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 0
作者:  Liu, Fei;  Liu, Jing;  Hong, Richang;  Lu, Hanqing
Adobe PDF(3550Kb)  |  收藏  |  浏览/下载:342/86  |  提交时间:2022/01/27
video question answering  attention mechanism  metric learning  
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Adobe PDF(4187Kb)  |  收藏  |  浏览/下载:229/4  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
Spiking Adaptive Dynamic Programming Based on Poisson Process for Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 11
作者:  Wei, Qinglai;  Han, Liyuan;  Zhang, Tielin
Adobe PDF(2904Kb)  |  收藏  |  浏览/下载:204/4  |  提交时间:2022/01/27
Maximum likelihood estimation (MLE)  Nonlinear systems  Optimal control  Poisson process  Spike train  Spiking Adaptive dynamic programming(SADP)