CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共17条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文
, 日本, 2024-6
作者:  Zhang Qingyang;  Xu Bo
Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:22/7  |  提交时间:2024/06/25
强化学习,分层强化学习  
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:38/14  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:22/10  |  提交时间:2024/06/25
Shifted Chunk Encoder for Transformer Based Streaming End-to-End ASR 会议论文
, Indore,India, 2022.11.28
作者:  Wang FY(王方圆);  Xu B(徐波)
Adobe PDF(1374Kb)  |  收藏  |  浏览/下载:50/17  |  提交时间:2024/06/13
Learning in bi-level markov games 会议论文
, Padua, Italy, 2022.7.18-2022.7.23
作者:  Meng Linghui;  Ruan Jingqing;  Xing Dengpeng;  Xu Bo
Adobe PDF(1450Kb)  |  收藏  |  浏览/下载:49/20  |  提交时间:2024/06/11
M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文
, London, United Kingdom, 2023.5.29-2023.6.2
作者:  Meng Linghui;  Ruan Jingqing;  Xiong Xuantang;  Li Xiyun;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:37/11  |  提交时间:2024/06/11
Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文
, Turin, Italy, 2023.9.18-2023.9.22
作者:  Meng Linghui;  Xiong Xuantang;  Zang Yifan;  Zhang Xi;  Li Guoqi;  Xing Dengpeng;  Xu Bo
Adobe PDF(841Kb)  |  收藏  |  浏览/下载:46/18  |  提交时间:2024/06/11
SA-MPF: A Status-Aware Mask Prediction Framework for Online Disease Diagnosis 会议论文
, Yokohama, Japan, 2024-6-30 - 2023-7-5
作者:  Zefa Hu;  Linghui Meng;  Yunlong Zhao;  Yuanyuan Zhao;  Shuang Xu;  Bo Xu
Adobe PDF(307Kb)  |  收藏  |  浏览/下载:62/13  |  提交时间:2024/05/29
TWO-STAGE PRE-TRAINING FOR SEQUENCE TO SEQUENCE SPEECH RECOGNITION 会议论文
, 线上会议, 2021-7-18
作者:  Fan ZY(范志赟);  Zhou SY(周世玉);  Xu B(徐波)
Adobe PDF(230Kb)  |  收藏  |  浏览/下载:193/51  |  提交时间:2022/09/17
pre-training  speech recognition  encoder-decoder  sequence-to-sequence  
A Working Memory Model for Task-oriented Dialog Response Generation 会议论文
, Florence, Italy, 2019-07
作者:  Chen, Xiuyi;  Xu, Jiaming;  Xu, Bo
Adobe PDF(792Kb)  |  收藏  |  浏览/下载:193/63  |  提交时间:2022/06/27