CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共30条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning 会议论文
, Kigali City, Rwanda, Africa, 2023-5-5
作者:  Junjie, Wang;  Yao, Mu;  Dong, Li;  Qichao,Zhang;  Dongbin, Zhao;  Yuzheng, Zhuang;  Ping, Luo;  Bin, Wang;  Jianye, Hao
Adobe PDF(3492Kb)  |  收藏  |  浏览/下载:144/39  |  提交时间:2023/06/29
Multi-Agent Reinforcement Learning Based on Clustering in Two-Player Games 会议论文
, Xiamen, China, 2019-12-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(488Kb)  |  收藏  |  浏览/下载:130/42  |  提交时间:2023/06/28
reinforcement learning  unsupervised clustering  matrix game  
Dynamic-horizon model-based value estimation with latent imagination 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-14
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(2305Kb)  |  收藏  |  浏览/下载:168/61  |  提交时间:2023/05/30
Latent world model  model-based value expansion (MVE)  reinforcement learning  reinforcement learning  
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Chai, Jiajun;  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin;  Ma, Zhe;  Sun, Kewu;  Ding, Jishiyu
Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:254/28  |  提交时间:2022/01/27
Multi-agent systems  Training  Task analysis  Reinforcement learning  Sun  Learning systems  Semantics  Centralized training with decentralized execution (CTDE)  multiagent  reinforcement learning  StarCraft II  
BNAS-v2: Memory-efficient and Performance-collapse-prevented Broad Neural Architecture Search 期刊论文
IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS: SYSTEMS, 2022, 卷号: 0, 期号: 0, 页码: 0
作者:  Zixiang, Ding;  Yaran, Chen;  Nannan, Li;  Dongbin, Zhao
Adobe PDF(7657Kb)  |  收藏  |  浏览/下载:206/53  |  提交时间:2022/01/07
Broad neural architecture search (BNAS), continuous relaxation, confident learning rate, partial channel connections, image classification.  
BNAS: Efficient Neural Architecture Search Using Broad Scalable Architecture 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 期号: 0, 页码: 0
作者:  Ding ZX(丁子祥);  Yaran, Chen;  Nannan, Li;  Dingbin, Zhao;  Zhiquan, Sun;  C. L. Philip Chen
Adobe PDF(2713Kb)  |  收藏  |  浏览/下载:182/42  |  提交时间:2022/01/06
Broad convolutional neural network (BCNN), image classification, neural architecture search (NAS), reinforcement learning (RL)  
Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 12
作者:  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1431Kb)  |  收藏  |  浏览/下载:292/54  |  提交时间:2021/12/28
Reinforcement learning  Missile guidance  Auxiliary learning  Self-imitation learning  
A Spatial-Temporal Attention Model forHuman Trajectory Prediction 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2020, 卷号: 7, 期号: 4, 页码: 965-974
作者:  Xiaodong Zhao;  Yaran Chen;  Jin Guo;  Dongbin Zhao
浏览  |  Adobe PDF(42191Kb)  |  收藏  |  浏览/下载:122/31  |  提交时间:2021/03/11
Attention mechanism  long-short term memory (LSTM)  spatial-temporal model  trajectory prediction  
MGRL: Graph neural network based inference in a Markov network with Reinforcement Learning for visual navigation 期刊论文
Neurocomputing, 2021, 卷号: 0, 期号: 0, 页码: 0
作者:  Lu, Yi;  Chen, Yaran;  Zhao, Dongbin;  Li, Dong
浏览  |  Adobe PDF(976Kb)  |  收藏  |  浏览/下载:258/75  |  提交时间:2020/10/19
Visual navigation, graph neural network, Markov network, reinforcement learning, probabilistic graph model  
Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 6, 页码: 2064-2076
作者:  Li, Haoran;  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(4274Kb)  |  收藏  |  浏览/下载:383/117  |  提交时间:2020/08/03
Robot sensing systems  Navigation  Entropy  Neural networks  Task analysis  Planning  Automatic exploration  deep reinforcement learning (DRL)  optimal decision  partial observation