已选(0)清除
条数/页: 排序方式: |
| Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文 , 日本, 2024-6 作者: Zhang Qingyang; Xu Bo Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:6/3  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang; Yang Yiming; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng; Xu Bo Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:6/3  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang; Zhang Hongming; Xing Dengpeng; Bo Xu Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:6/4  |  提交时间:2024/06/25 |
| Lead ASR Models to Generalize Better Using Approximated Bias-Variance Tradeof 会议论文 , changsha,China, 2023.11.13 作者: Wang FY(王方圆); Ming Hao; Yuhai Shi; Bo Xu Adobe PDF(1933Kb)  |  收藏  |  浏览/下载:27/10  |  提交时间:2024/06/12 |
| Multi-Scale Dynamic Coding Improved Spiking Actor Network for Reinforcement Learning 会议论文 , Online, February 22–March 1, 2022 作者: Zhang, Duzhen; Zhang, Tielin; Jia, Shuncheng; Xu, Bo Adobe PDF(2249Kb)  |  收藏  |  浏览/下载:16/7  |  提交时间:2024/06/11 |
| Learning in bi-level markov games 会议论文 , Padua, Italy, 2022.7.18-2022.7.23 作者: Meng Linghui; Ruan Jingqing; Xing Dengpeng; Xu Bo Adobe PDF(1450Kb)  |  收藏  |  浏览/下载:18/5  |  提交时间:2024/06/11 |
| M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文 , London, United Kingdom, 2023.5.29-2023.6.2 作者: Meng Linghui; Ruan Jingqing; Xiong Xuantang; Li Xiyun; Zhang Xi; Xing Dengpeng; Xu Bo Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:15/3  |  提交时间:2024/06/11 |
| Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文 , Turin, Italy, 2023.9.18-2023.9.22 作者: Meng Linghui; Xiong Xuantang; Zang Yifan; Zhang Xi; Li Guoqi; Xing Dengpeng; Xu Bo Adobe PDF(841Kb)  |  收藏  |  浏览/下载:19/8  |  提交时间:2024/06/11 |
| T-Agent: A Term-Aware Agent for Medical Dialogue Generation 会议论文 , Yokohama, Japan, 2024-6-30 - 2023-7-5 作者: Zefa Hu; Haozhi Zhao; Yuanyuan Zhao; Shuang Xu; Bo Xu Adobe PDF(483Kb)  |  收藏  |  浏览/下载:28/8  |  提交时间:2024/05/29 |
| SA-MPF: A Status-Aware Mask Prediction Framework for Online Disease Diagnosis 会议论文 , Yokohama, Japan, 2024-6-30 - 2023-7-5 作者: Zefa Hu; Linghui Meng; Yunlong Zhao; Yuanyuan Zhao; Shuang Xu; Bo Xu Adobe PDF(307Kb)  |  收藏  |  浏览/下载:32/7  |  提交时间:2024/05/29 |