CASIA OpenIR

浏览/检索结果: 共9条,第1-9条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:4/3  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:5/3  |  提交时间:2024/06/25
Learning Superior Cooperative Policy in Competitive Multi-team Reinforcement Learning 会议论文
, Gold Coast, Australia, 2023-6
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Xiaolin Ai;  Wanmai Yuan
Adobe PDF(25675Kb)  |  收藏  |  浏览/下载:21/3  |  提交时间:2024/06/05
Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural language. 会议论文
, 北京华腾美居酒店, 2023-12-9
作者:  Zhourui Guo;  Meng Yao;  Yang Yu;  Qiyue Yin
Adobe PDF(2302Kb)  |  收藏  |  浏览/下载:10/5  |  提交时间:2024/06/03
Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文
Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8
作者:  He SQ(何少钦);  Gao Y(高阳);  Zhang BF(张保丰);  Chang H(常惠);  Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:27/11  |  提交时间:2024/05/31
Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.  
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文
, New Orleans, LA, USA, December 10-16, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Guangchong Zhou;  Zeren Zhang;  Guoliang Fan
Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:24/5  |  提交时间:2024/05/28
A Multi-modal Global Instance Tracking Benchmark (MGIT): Better Locating Target in Complex Spatio-temporal and causal Relationship 会议论文
, New Orleans, 2023-12
作者:  Shiyu, Hu;  Dailing, Zhang;  Meiqi, Wu;  Xiaokun, Feng;  Xuchen, Li;  Xin, Zhao;  Kaiqi, Huang
Adobe PDF(6215Kb)  |  收藏  |  浏览/下载:104/22  |  提交时间:2024/01/22
TBERT: Dynamic BERT Inference with Top-k Based Predictors 会议论文
, Antwerp, Belgium, 2023-4-17
作者:  Liu, Zejian;  Zhao, Kun;  Cheng, Jian
Adobe PDF(3426Kb)  |  收藏  |  浏览/下载:104/27  |  提交时间:2023/06/19
Transformer  Dynamic Inference  Pruning  
Learning Cooperative Policies with Graph Networks in Distributed Swarm Systems 会议论文
, Queensland, Australia, June 18-23, 2023
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强);  Ai XL(艾晓琳);  Yuan GM(袁莞迈)
Adobe PDF(612Kb)  |  收藏  |  浏览/下载:176/51  |  提交时间:2023/06/12