CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共28条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
MAT: Morphological Adaptive Transformer for Universal Morphology Policy Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 1-12
作者:  Boyu Li;  Haran Li;  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(9953Kb)  |  收藏  |  浏览/下载:31/10  |  提交时间:2024/06/05
Optimal Pedestrian Evacuation in Building with Consecutive Differential Dynamic Programming 会议论文
, Budapest, Hungary, 2019-7-14
作者:  Zhu YH(朱圆恒);  Haibo He;  Dongbin Zhao;  Zhongsheng Hou
Adobe PDF(679Kb)  |  收藏  |  浏览/下载:73/35  |  提交时间:2023/05/22
A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat 期刊论文
IEEE Transactions on Systems, Man and Cybernetics: Systems, 2023, 页码: DOI: 10.1109/TSMC.2023.3270444
作者:  Jiajun Chai;  Wenzhang Chen;  Yuanheng Zhu;  Zong-xin Yao,;  Dongbin Zhao
Adobe PDF(9249Kb)  |  收藏  |  浏览/下载:289/128  |  提交时间:2023/04/26
Vision-based control in the open racing car simulator with deep and reinforcement learning 期刊论文
Journal of Ambient Intelligence and Humanized Computing, 2019, 页码: doi={10.1007/s12652-019-01503-y}
作者:  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(2210Kb)  |  收藏  |  浏览/下载:64/17  |  提交时间:2023/04/26
Empirical Policy Optimization for n-Player Markov Games 期刊论文
IEEE Transactions on Cybernetics, 2022, 页码: doi={10.1109/TCYB.2022.3179775}
作者:  Yuanheng Zhu;  Weifan Li;  Mengchen Zhao;  Jianye Hao;  Dongbin Zhao
Adobe PDF(1739Kb)  |  收藏  |  浏览/下载:111/44  |  提交时间:2023/04/26
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:250/12  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Invariant Adaptive Dynamic Programming for Discrete-Time Optimal Control 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 卷号: 50, 期号: 11, 页码: 3959-3971
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo
Adobe PDF(2079Kb)  |  收藏  |  浏览/下载:213/16  |  提交时间:2021/01/07
Optimal control  Discrete-time systems  Heuristic algorithms  Dynamic programming  Convergence  Artificial intelligence  Nonlinear systems  Adaptive dynamic programming  discrete-time systems  invariant admissibility  optimal control  policy iteration  sum of squares  
LMI-Based Synthesis of String-Stable Controller for Cooperative Adaptive Cruise Control 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 卷号: 21, 期号: 11, 页码: 4516-4525
作者:  Zhu, Yuanheng;  He, Haibo;  Zhao, Dongbin
Adobe PDF(1648Kb)  |  收藏  |  浏览/下载:187/20  |  提交时间:2021/01/06
Cooperative adaptive cruise control  string stability  time-delay system  H-infinity control  linear matrix inequality  
Control-Limited Adaptive Dynamic Programming for Multi-Battery Energy Storage Systems 期刊论文
IEEE TRANSACTIONS ON SMART GRID, 2019, 卷号: 10, 期号: 4, 页码: 4235-4244
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun;  Wang, Ding
Adobe PDF(973Kb)  |  收藏  |  浏览/下载:315/15  |  提交时间:2019/09/30
Microgrid  energy storage system  multi-battery management system  adaptive dynamic programming  control-limited optimization  
Policy Iteration for H infinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 2, 页码: 500-509
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Yang, Xiong;  Zhang, Qichao
Adobe PDF(892Kb)  |  收藏  |  浏览/下载:326/51  |  提交时间:2018/10/10
Adaptive Dynamic Programming (Adp)  h Infinity Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)