已选(0)清除
条数/页: 排序方式: |
| Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文 , 日本, 2024-6 作者: Zhang Qingyang; Xu Bo Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:22/7  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang; Yang Yiming; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng; Xu Bo Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:38/14  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang; Zhang Hongming; Xing Dengpeng; Bo Xu Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:22/10  |  提交时间:2024/06/25 |
| Shifted Chunk Encoder for Transformer Based Streaming End-to-End ASR 会议论文 , Indore,India, 2022.11.28 作者: Wang FY(王方圆); Xu B(徐波) Adobe PDF(1374Kb)  |  收藏  |  浏览/下载:50/17  |  提交时间:2024/06/13 |
| Learning in bi-level markov games 会议论文 , Padua, Italy, 2022.7.18-2022.7.23 作者: Meng Linghui; Ruan Jingqing; Xing Dengpeng; Xu Bo Adobe PDF(1450Kb)  |  收藏  |  浏览/下载:49/20  |  提交时间:2024/06/11 |
| M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文 , London, United Kingdom, 2023.5.29-2023.6.2 作者: Meng Linghui; Ruan Jingqing; Xiong Xuantang; Li Xiyun; Zhang Xi; Xing Dengpeng; Xu Bo Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:37/11  |  提交时间:2024/06/11 |
| Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文 , Turin, Italy, 2023.9.18-2023.9.22 作者: Meng Linghui; Xiong Xuantang; Zang Yifan; Zhang Xi; Li Guoqi; Xing Dengpeng; Xu Bo Adobe PDF(841Kb)  |  收藏  |  浏览/下载:46/18  |  提交时间:2024/06/11 |
| SA-MPF: A Status-Aware Mask Prediction Framework for Online Disease Diagnosis 会议论文 , Yokohama, Japan, 2024-6-30 - 2023-7-5 作者: Zefa Hu; Linghui Meng; Yunlong Zhao; Yuanyuan Zhao; Shuang Xu; Bo Xu Adobe PDF(307Kb)  |  收藏  |  浏览/下载:62/13  |  提交时间:2024/05/29 |
| TWO-STAGE PRE-TRAINING FOR SEQUENCE TO SEQUENCE SPEECH RECOGNITION 会议论文 , 线上会议, 2021-7-18 作者: Fan ZY(范志赟); Zhou SY(周世玉); Xu B(徐波) Adobe PDF(230Kb)  |  收藏  |  浏览/下载:193/51  |  提交时间:2022/09/17 pre-training speech recognition encoder-decoder sequence-to-sequence |
| A Working Memory Model for Task-oriented Dialog Response Generation 会议论文 , Florence, Italy, 2019-07 作者: Chen, Xiuyi; Xu, Jiaming; Xu, Bo Adobe PDF(792Kb)  |  收藏  |  浏览/下载:193/63  |  提交时间:2022/06/27 |