已选(0)清除
条数/页: 排序方式: |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang; Yang Yiming; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng; Xu Bo Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:44/17  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang; Zhang Hongming; Xing Dengpeng; Bo Xu Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:26/12  |  提交时间:2024/06/25 |
| Matching-based Term Semantics Pre-training for Spoken Patient Query Understanding 会议论文 , Rhodes, Greece, 2023-6-6 - 2023-6-10 作者: Zefa Hu; Xiuyi Chen; Haoran Wu; Minglun Han; Ziyi Ni; Jing Shi; Shuang Xu; Bo Xu Adobe PDF(1049Kb)  |  收藏  |  浏览/下载:69/23  |  提交时间:2024/05/29 |
| VLP: A Survey on Vision-language Pre-training 期刊论文 Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56 作者: Feilong Chen; Duzhen Zhang; Minglun Han; Xiuyi Chen; Jing Shi; Shuang Xu; Bo Xu Adobe PDF(969Kb)  |  收藏  |  浏览/下载:182/37  |  提交时间:2023/06/21 |
| Knowledge Transfer from Pre-Trained Language Models to CIF-Based Speech Recognizers via Hierarchical Distillation 会议论文 , Dublin, Ireland, 2023-8-20 作者: Minglun Han; Feilong Chen; Jing Shi; Shuang Xu; Bo Xu Adobe PDF(563Kb)  |  收藏  |  浏览/下载:204/75  |  提交时间:2023/06/20 |