CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共20条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:30/12  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:16/9  |  提交时间:2024/06/25
Lead ASR Models to Generalize Better Using Approximated Bias-Variance Tradeof 会议论文
, changsha,China, 2023.11.13
作者:  Wang FY(王方圆);  Ming Hao;  Yuhai Shi;  Bo Xu
Adobe PDF(1933Kb)  |  收藏  |  浏览/下载:44/17  |  提交时间:2024/06/12
A New Pre-Training Paradigm for Offline Multi-Agent Reinforcement Learning with Suboptimal Data 会议论文
, Seoul, Korea, 2024.4.14-2024.4.19
作者:  Meng Linghui;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(964Kb)  |  收藏  |  浏览/下载:38/14  |  提交时间:2024/06/11
T-Agent: A Term-Aware Agent for Medical Dialogue Generation 会议论文
, Yokohama, Japan, 2024-6-30 - 2023-7-5
作者:  Zefa Hu;  Haozhi Zhao;  Yuanyuan Zhao;  Shuang Xu;  Bo Xu
Adobe PDF(483Kb)  |  收藏  |  浏览/下载:52/15  |  提交时间:2024/05/29
Enhancing Multi-agent Coordination via Dual-channel Consensus 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 349-368
作者:  Qingyang Zhang;  Kaishen Wang;  Jingqing Ruan;  Yiming Yang;  Dengpeng Xing;  Bo Xu
Adobe PDF(4997Kb)  |  收藏  |  浏览/下载:65/23  |  提交时间:2024/04/23
Multi-agent reinforcement learning, contrastive representation learning, consensus, multi-agent cooperation, cognitive consistency  
Offline Pre-trained Multi-agent Decision Transformer 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 233-248
作者:  Linghui Meng;  Muning Wen;  Chenyang Le;  Xiyun Li;  Dengpeng Xing;  Weinan Zhang;  Ying Wen;  Haifeng Zhang;  Jun Wang;  Yaodong Yang;  Bo Xu
Adobe PDF(2121Kb)  |  收藏  |  浏览/下载:54/13  |  提交时间:2024/04/23
Pre-training model  multi-agent reinforcement learning (MARL)  decision making  transformer  offline reinforcement learning  
Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition 会议论文
, Washington D.C., USA, 2023-2-9
作者:  Qingyu Wang;  Tielin Zhang;  Minglun Han;  Yi Wang;  Duzhen Zhang;  Bo Xu
Adobe PDF(1714Kb)  |  收藏  |  浏览/下载:180/51  |  提交时间:2023/06/20
IMPROVING CROSS-MODAL UNDERSTANDING IN VISUAL DIALOG VIA CONTRASTIVE LEARNING 会议论文
, Singapore, 2022.5
作者:  Feilong Chen;  Duzhen Zhang;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(9035Kb)  |  收藏  |  浏览/下载:235/99  |  提交时间:2023/06/07
Unsupervised and Pseudo-Supervised Vision-Language Alignment in Visual Dialog 会议论文
, Lisboa, Portugal, October 10–14, 2022
作者:  Feilong Chen;  Duzhen Zhang;  Xiuyi Chen;  Jing Shi;  Shang Xu;  Bo Xu
Adobe PDF(9035Kb)  |  收藏  |  浏览/下载:276/154  |  提交时间:2023/06/05