已选(0)清除
条数/页: 排序方式: |
| Dynamic-horizon model-based value estimation with latent imagination 期刊论文 IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-14 作者: Wang JJ(王俊杰); Zhang QC(张启超); Zhao DB(赵冬斌) Adobe PDF(2305Kb)  |  收藏  |  浏览/下载:156/59  |  提交时间:2023/05/30 Latent world model model-based value expansion (MVE) reinforcement learning reinforcement learning |
| A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat 期刊论文 IEEE Transactions on Systems, Man and Cybernetics: Systems, 2023, 页码: DOI: 10.1109/TSMC.2023.3270444 作者: Jiajun Chai; Wenzhang Chen; Yuanheng Zhu; Zong-xin Yao,; Dongbin Zhao Adobe PDF(9249Kb)  |  收藏  |  浏览/下载:210/112  |  提交时间:2023/04/26 |
| Soft Contrastive Learning with Q-irrelevance Abstraction for Reinforcement Learning 期刊论文 IEEE Transactions on Cognitive and Developmental Systems, 2022, 页码: doi={10.1109/TCDS.2022.3218940} 作者: Minsong Liu; Luntong Li; Shuai Hao; Yuanheng Zhu; Dongbin Zhao Adobe PDF(12013Kb)  |  收藏  |  浏览/下载:73/19  |  提交时间:2023/04/26 |
| Vision-based control in the open racing car simulator with deep and reinforcement learning 期刊论文 Journal of Ambient Intelligence and Humanized Computing, 2019, 页码: doi={10.1007/s12652-019-01503-y} 作者: Yuanheng Zhu; Dongbin Zhao Adobe PDF(2210Kb)  |  收藏  |  浏览/下载:46/11  |  提交时间:2023/04/26 |
| Empirical Policy Optimization for n-Player Markov Games 期刊论文 IEEE Transactions on Cybernetics, 2022, 页码: doi={10.1109/TCYB.2022.3179775} 作者: Yuanheng Zhu; Weifan Li; Mengchen Zhao; Jianye Hao; Dongbin Zhao Adobe PDF(1739Kb)  |  收藏  |  浏览/下载:95/38  |  提交时间:2023/04/26 |
| UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12 作者: Chai, Jiajun; Li, Weifan; Zhu, Yuanheng; Zhao, Dongbin; Ma, Zhe; Sun, Kewu; Ding, Jishiyu Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:241/25  |  提交时间:2022/01/27 Multi-agent systems Training Task analysis Reinforcement learning Sun Learning systems Semantics Centralized training with decentralized execution (CTDE) multiagent reinforcement learning StarCraft II |
| Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target 期刊论文 COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 12 作者: Li, Weifan; Zhu, Yuanheng; Zhao, Dongbin Adobe PDF(1431Kb)  |  收藏  |  浏览/下载:282/51  |  提交时间:2021/12/28 Reinforcement learning Missile guidance Auxiliary learning Self-imitation learning |
| 深度强化学习进展: 从 AlphaGo 到 AlphaGo Zero 期刊论文 控 制 理 论 与 应 用, 2017, 卷号: 34, 期号: 12, 页码: 1529-1546 作者: 唐振韬; 邵 坤; 赵冬斌; 朱圆恒 Adobe PDF(8232Kb)  |  收藏  |  浏览/下载:225/34  |  提交时间:2021/07/05 深度强化学习 AlphaGo Zero 深度学习 强化学习 人工智能 |
| A Spatial-Temporal Attention Model forHuman Trajectory Prediction 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2020, 卷号: 7, 期号: 4, 页码: 965-974 作者: Xiaodong Zhao; Yaran Chen; Jin Guo; Dongbin Zhao 浏览  |  Adobe PDF(42191Kb)  |  收藏  |  浏览/下载:113/30  |  提交时间:2021/03/11 Attention mechanism long-short term memory (LSTM) spatial-temporal model trajectory prediction |
| Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment 期刊论文 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 6, 页码: 2064-2076 作者: Li, Haoran; Zhang, Qichao; Zhao, Dongbin Adobe PDF(4274Kb)  |  收藏  |  浏览/下载:369/114  |  提交时间:2020/08/03 Robot sensing systems Navigation Entropy Neural networks Task analysis Planning Automatic exploration deep reinforcement learning (DRL) optimal decision partial observation |