已选(0)清除
条数/页: 排序方式: |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang; Zhang Hongming; Xing Dengpeng; Bo Xu Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:22/10  |  提交时间:2024/06/25 |
| Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文 Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8 作者: He SQ(何少钦); Gao Y(高阳); Zhang BF(张保丰); Chang H(常惠); Zhang XC(张鑫辰) Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:64/21  |  提交时间:2024/05/31 Air Combat, Reinforcement Learning, Neural Fictitious Self-Play. |
| T-Agent: A Term-Aware Agent for Medical Dialogue Generation 会议论文 , Yokohama, Japan, 2024-6-30 - 2023-7-5 作者: Zefa Hu; Haozhi Zhao; Yuanyuan Zhao; Shuang Xu; Bo Xu Adobe PDF(483Kb)  |  收藏  |  浏览/下载:56/16  |  提交时间:2024/05/29 |
| Matching-based Term Semantics Pre-training for Spoken Patient Query Understanding 会议论文 , Rhodes, Greece, 2023-6-6 - 2023-6-10 作者: Zefa Hu; Xiuyi Chen; Haoran Wu; Minglun Han; Ziyi Ni; Jing Shi; Shuang Xu; Bo Xu Adobe PDF(1049Kb)  |  收藏  |  浏览/下载:63/20  |  提交时间:2024/05/29 |
| Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文 , New Orleans, LA, USA, December 10-16, 2023 作者: Zhiwei Xu; Bin Zhang; Dapeng Li; Guangchong Zhou; Zeren Zhang; Guoliang Fan Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:54/15  |  提交时间:2024/05/28 |
| Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文 , Washington, DC, USA, February 7-14, 2023 作者: Zhiwei Xu; Bin Zhang; Dapeng Li; Zeren Zhang; Guangchong Zhou; Hao Chen; Guoliang Fan Adobe PDF(4141Kb)  |  收藏  |  浏览/下载:48/19  |  提交时间:2024/05/28 |
| Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning 会议论文 , New Orleans, LA, USA,, November 28 - December 9, 2022 作者: Zhiwei Xu; Dapeng Li; Bin Zhang; Yuan Zhan; Yunpeng Bai; Guoliang Fan Adobe PDF(4367Kb)  |  收藏  |  浏览/下载:38/7  |  提交时间:2024/05/28 |
| SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning 会议论文 , Auckland, New Zealand, May 9-13, 2022 作者: Zhiwei Xu; Yunpeng Bai; Dapeng Li; Bin Zhang; Guoliang Fan Adobe PDF(2965Kb)  |  收藏  |  浏览/下载:40/7  |  提交时间:2024/05/28 |
| VLP: A Survey on Vision-language Pre-training 期刊论文 Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56 作者: Feilong Chen; Duzhen Zhang; Minglun Han; Xiuyi Chen; Jing Shi; Shuang Xu; Bo Xu Adobe PDF(969Kb)  |  收藏  |  浏览/下载:177/34  |  提交时间:2023/06/21 |
| Exploration via Joint Policy Diversity for Sparse-Reward Multi-Agent Tasks 会议论文 , Macao, China, 2023-8 作者: Pei Xu; Junge Zhang; Kaiqi Huang Adobe PDF(1369Kb)  |  收藏  |  浏览/下载:278/86  |  提交时间:2023/06/19 |