CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共4条,第1-4条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Benchmarking lane-changing decision-making for deep reinforcement learning 会议论文
, Guangzhou, China, 2021-11
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(1117Kb)  |  收藏  |  浏览/下载:157/54  |  提交时间:2023/05/30
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Adobe PDF(4187Kb)  |  收藏  |  浏览/下载:261/12  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 12
作者:  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1431Kb)  |  收藏  |  浏览/下载:319/60  |  提交时间:2021/12/28
Reinforcement learning  Missile guidance  Auxiliary learning  Self-imitation learning  
Optimal Feedback Control of Pedestrian Flow in Heterogeneous Corridors 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2021, 卷号: 18, 期号: 3, 页码: 1097-1108
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo
Adobe PDF(2666Kb)  |  收藏  |  浏览/下载:234/16  |  提交时间:2021/08/15
Microscopy  Feedback control  Mathematical model  Data models  Dynamic programming  Psychology  Computational modeling  Adaptive dynamic programming (ADP)  heterogeneous corridors  macroscopic pedestrian dynamics  optimal feedback control  pedestrian flow