CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共102条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Multi-Agent Reinforcement Learning Based on Clustering in Two-Player Games 会议论文
, Xiamen, China, 2019-12-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(488Kb)  |  收藏  |  浏览/下载:94/33  |  提交时间:2023/06/28
reinforcement learning  unsupervised clustering  matrix game  
BViT: Broad Attention-Based Vision Transformer 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2023, 页码: 1 - 12
作者:  Nannan Li;  Yaran Chen;  Weifan Li;  Zixiang Ding;  Dongbin Zhao;  Shuai Nie
Adobe PDF(2171Kb)  |  收藏  |  浏览/下载:162/47  |  提交时间:2023/06/27
Broad attention  broad connection  image classification  parameter-free attention  vision transformer  
Dynamic-horizon model-based value estimation with latent imagination 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-14
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(2305Kb)  |  收藏  |  浏览/下载:131/53  |  提交时间:2023/05/30
Latent world model  model-based value expansion (MVE)  reinforcement learning  reinforcement learning  
Optimal Pedestrian Evacuation in Building with Consecutive Differential Dynamic Programming 会议论文
, Budapest, Hungary, 2019-7-14
作者:  Zhu YH(朱圆恒);  Haibo He;  Dongbin Zhao;  Zhongsheng Hou
Adobe PDF(679Kb)  |  收藏  |  浏览/下载:52/27  |  提交时间:2023/05/22
A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat 期刊论文
IEEE Transactions on Systems, Man and Cybernetics: Systems, 2023, 页码: DOI: 10.1109/TSMC.2023.3270444
作者:  Jiajun Chai;  Wenzhang Chen;  Yuanheng Zhu;  Zong-xin Yao,;  Dongbin Zhao
Adobe PDF(9249Kb)  |  收藏  |  浏览/下载:195/106  |  提交时间:2023/04/26
Soft Contrastive Learning with Q-irrelevance Abstraction for Reinforcement Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2022, 页码: doi={10.1109/TCDS.2022.3218940}
作者:  Minsong Liu;  Luntong Li;  Shuai Hao;  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(12013Kb)  |  收藏  |  浏览/下载:64/16  |  提交时间:2023/04/26
Vision-based control in the open racing car simulator with deep and reinforcement learning 期刊论文
Journal of Ambient Intelligence and Humanized Computing, 2019, 页码: doi={10.1007/s12652-019-01503-y}
作者:  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(2210Kb)  |  收藏  |  浏览/下载:39/8  |  提交时间:2023/04/26
Empirical Policy Optimization for n-Player Markov Games 期刊论文
IEEE Transactions on Cybernetics, 2022, 页码: doi={10.1109/TCYB.2022.3179775}
作者:  Yuanheng Zhu;  Weifan Li;  Mengchen Zhao;  Jianye Hao;  Dongbin Zhao
Adobe PDF(1739Kb)  |  收藏  |  浏览/下载:87/35  |  提交时间:2023/04/26
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
收藏  |  浏览/下载:185/0  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Boost 3-D Object Detection via Point Clouds Segmentation and Fused 3-D GIoU-L-1 Loss 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 2, 页码: 762-773
作者:  Chen, Yaran;  Li, Haoran;  Gao, Ruiyuan;  Zhao, Dongbin
Adobe PDF(2082Kb)  |  收藏  |  浏览/下载:211/42  |  提交时间:2022/03/17
3-D object detection  generalized Intersection of Union (GIoU) loss  segmentation