CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共14条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Planning-Inspired Hierarchical Trajectory Prediction via Lateral-Longitudinal Decomposition for Autonomous Driving 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 卷号: 9, 期号: 1, 页码: 692-703
作者:  Li, Ding;  Zhang, Qichao;  Xia, Zhongpu;  Zheng, Yupeng;  Zhang, Kuan;  Yi, Menglong;  Jin, Wenda;  Zhao, Dongbin
收藏  |  浏览/下载:8/0  |  提交时间:2024/07/22
Trajectory  Roads  Predictive models  Planning  Task analysis  Behavioral sciences  Vehicle dynamics  Autonomous driving  trajectory prediction  anchor-based prediction  
Prototypical Context-Aware Dynamics for Generalization in Visual Control With Model-Based Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 页码: 11
作者:  Wang, Junjie;  Zhang, Qichao;  Mu, Yao;  Li, Dong;  Zhao, Dongbin;  Zhuang, Yuzheng;  Luo, Ping;  Wang, Bin;  Hao, Jianye
收藏  |  浏览/下载:15/0  |  提交时间:2024/07/04
Context-aware  dynamics generalization  model-based reinforcement learning  visual control  
Conditional Goal-Oriented Trajectory Prediction for Interacting Vehicles 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Li, Ding;  Zhang, Qichao;  Lu, Shuai;  Pan, Yifeng;  Zhao, Dongbin
收藏  |  浏览/下载:167/0  |  提交时间:2023/12/21
Trajectory  Predictive models  Behavioral sciences  Pipelines  Task analysis  Feature extraction  Vehicle dynamics  Conditional prediction  goal-oriented trajectory prediction  hierarchical vectorized representation  joint trajectory prediction  marginal prediction  
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:64/4  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow  
Data Generation Feedback Relearning Control for Unmodeled Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 页码: 12
作者:  Zhang, Yong;  Mu, Chaoxu;  Zhao, Dongbin
收藏  |  浏览/下载:112/0  |  提交时间:2023/11/16
Data models  Real-time systems  Heuristic algorithms  Mathematical models  Adaptation models  Approximation algorithms  Cost function  Data generation model  feedback relearning control  delayed neural network  reinforcement learning  unmodeled nonlinear system  
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:257/14  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Chai, Jiajun;  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin;  Ma, Zhe;  Sun, Kewu;  Ding, Jishiyu
Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:292/38  |  提交时间:2022/01/27
Multi-agent systems  Training  Task analysis  Reinforcement learning  Sun  Learning systems  Semantics  Centralized training with decentralized execution (CTDE)  multiagent  reinforcement learning  StarCraft II  
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Adobe PDF(4187Kb)  |  收藏  |  浏览/下载:267/12  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 12
作者:  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1431Kb)  |  收藏  |  浏览/下载:327/62  |  提交时间:2021/12/28
Reinforcement learning  Missile guidance  Auxiliary learning  Self-imitation learning  
Optimal Feedback Control of Pedestrian Flow in Heterogeneous Corridors 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2021, 卷号: 18, 期号: 3, 页码: 1097-1108
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo
Adobe PDF(2666Kb)  |  收藏  |  浏览/下载:239/17  |  提交时间:2021/08/15
Microscopy  Feedback control  Mathematical model  Data models  Dynamic programming  Psychology  Computational modeling  Adaptive dynamic programming (ADP)  heterogeneous corridors  macroscopic pedestrian dynamics  optimal feedback control  pedestrian flow