CASIA OpenIR

浏览/检索结果: 共22条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:22/10  |  提交时间:2024/06/25
Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文
Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8
作者:  He SQ(何少钦);  Gao Y(高阳);  Zhang BF(张保丰);  Chang H(常惠);  Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:64/21  |  提交时间:2024/05/31
Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.  
T-Agent: A Term-Aware Agent for Medical Dialogue Generation 会议论文
, Yokohama, Japan, 2024-6-30 - 2023-7-5
作者:  Zefa Hu;  Haozhi Zhao;  Yuanyuan Zhao;  Shuang Xu;  Bo Xu
Adobe PDF(483Kb)  |  收藏  |  浏览/下载:56/16  |  提交时间:2024/05/29
Matching-based Term Semantics Pre-training for Spoken Patient Query Understanding 会议论文
, Rhodes, Greece, 2023-6-6 - 2023-6-10
作者:  Zefa Hu;  Xiuyi Chen;  Haoran Wu;  Minglun Han;  Ziyi Ni;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(1049Kb)  |  收藏  |  浏览/下载:63/20  |  提交时间:2024/05/29
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文
, New Orleans, LA, USA, December 10-16, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Guangchong Zhou;  Zeren Zhang;  Guoliang Fan
Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:54/15  |  提交时间:2024/05/28
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Zeren Zhang;  Guangchong Zhou;  Hao Chen;  Guoliang Fan
Adobe PDF(4141Kb)  |  收藏  |  浏览/下载:48/19  |  提交时间:2024/05/28
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning 会议论文
, New Orleans, LA, USA,, November 28 - December 9, 2022
作者:  Zhiwei Xu;  Dapeng Li;  Bin Zhang;  Yuan Zhan;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(4367Kb)  |  收藏  |  浏览/下载:38/7  |  提交时间:2024/05/28
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning 会议论文
, Auckland, New Zealand, May 9-13, 2022
作者:  Zhiwei Xu;  Yunpeng Bai;  Dapeng Li;  Bin Zhang;  Guoliang Fan
Adobe PDF(2965Kb)  |  收藏  |  浏览/下载:40/7  |  提交时间:2024/05/28
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:177/34  |  提交时间:2023/06/21
Exploration via Joint Policy Diversity for Sparse-Reward Multi-Agent Tasks 会议论文
, Macao, China, 2023-8
作者:  Pei Xu;  Junge Zhang;  Kaiqi Huang
Adobe PDF(1369Kb)  |  收藏  |  浏览/下载:278/86  |  提交时间:2023/06/19