CASIA OpenIR

浏览/检索结果: 共8条,第1-8条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:62/3  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow  
Neural event-triggered optimal filtering co-design of Markovian jump systems with hidden mode detections 期刊论文
TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2023, 页码: 11
作者:  Ma, Chao;  Lu, Yanfeng;  Wu, Wei
Adobe PDF(1257Kb)  |  收藏  |  浏览/下载:151/16  |  提交时间:2023/03/20
Markovian jump system  neural event-triggered scheme  optimal filtering  unknown nonlinearity  hidden mode detections  
Decentralized Autonomous Operations and Organizations in TransVerse: Federated Intelligence for Smart Mobility 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 卷号: 53, 期号: 4, 页码: 2062-2072
作者:  Zhao, Chen;  Dai, Xingyuan;  Lv, Yisheng;  Niu, Jinglong;  Lin, Yilun
Adobe PDF(1921Kb)  |  收藏  |  浏览/下载:291/6  |  提交时间:2023/02/22
Intelligent Transportation Systems (ITS)  Artificial Systems, Computational Experiments, Parallel Execution (ACP)  Cyber–Physical–Social Systems (CPSS)  
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Adobe PDF(4187Kb)  |  收藏  |  浏览/下载:264/12  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
Decentralized Event-Driven Constrained Control Using Adaptive Critic Designs 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 15
作者:  Yang, Xiong;  Zhu, Yuanheng;  Dong, Na;  Wei, Qinglai
Adobe PDF(1578Kb)  |  收藏  |  浏览/下载:239/15  |  提交时间:2022/01/27
Adaptive critic designs (ACDs)  adaptive dynamic programming (ADP)  decentralized event-driven control  input constraint  reinforcement learning (RL)  
Generalized Actor-Critic Learning Optimal Control in Smart Home Energy Management 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 卷号: 17, 期号: 10, 页码: 6614-6623
作者:  Wei, Qinglai;  Liao, Zehua;  Shi, Guang
Adobe PDF(1229Kb)  |  收藏  |  浏览/下载:290/38  |  提交时间:2021/11/02
Optimal control  Process control  Smart homes  Dynamic programming  Numerical models  Iterative methods  Informatics  Actor-critic learning  adaptive critic designs  adaptive dynamic programming (ADP)  approximate dynamic programming  energy management  optimal control  smart grid  
Continuous-Time Distributed Policy Iteration for Multicontroller Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 卷号: 51, 期号: 5, 页码: 2372-2383
作者:  Wei, Qinglai;  Li, Hongyang;  Yang, Xiong;  He, Haibo
Adobe PDF(1246Kb)  |  收藏  |  浏览/下载:283/54  |  提交时间:2021/06/07
Optimal control  Nonlinear systems  Decentralized control  Mathematical model  Convergence  Multi-agent systems  Adaptive dynamic programming (ADP)  approximate dynamic programming  distributed policy iteration  nonlinear systems  optimal control  
Continuous-Time Time-Varying Policy Iteration 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 卷号: 50, 期号: 12, 页码: 4958-4971
作者:  Wei, Qinglai;  Liao, Zehua;  Yang, Zhanyu;  Li, Benkai;  Liu, Derong
Adobe PDF(3149Kb)  |  收藏  |  浏览/下载:302/56  |  提交时间:2021/03/02
Optimal control  Nonlinear systems  Time-varying systems  Mathematical model  Dynamic programming  Approximation algorithms  Iterative algorithms  Adaptive critic designs  adaptive dynamic programming (ADP)  neuro-dynamic programming  nonlinear systems  optimal control  policy iteration