已选(0)清除
条数/页: 排序方式: |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang; Yang Yiming; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng; Xu Bo Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:30/12  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang; Zhang Hongming; Xing Dengpeng; Bo Xu Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:16/9  |  提交时间:2024/06/25 |
| Lead ASR Models to Generalize Better Using Approximated Bias-Variance Tradeof 会议论文 , changsha,China, 2023.11.13 作者: Wang FY(王方圆); Ming Hao; Yuhai Shi; Bo Xu Adobe PDF(1933Kb)  |  收藏  |  浏览/下载:44/17  |  提交时间:2024/06/12 |
| A New Pre-Training Paradigm for Offline Multi-Agent Reinforcement Learning with Suboptimal Data 会议论文 , Seoul, Korea, 2024.4.14-2024.4.19 作者: Meng Linghui; Zhang Xi; Xing Dengpeng; Xu Bo Adobe PDF(964Kb)  |  收藏  |  浏览/下载:38/14  |  提交时间:2024/06/11 |
| T-Agent: A Term-Aware Agent for Medical Dialogue Generation 会议论文 , Yokohama, Japan, 2024-6-30 - 2023-7-5 作者: Zefa Hu; Haozhi Zhao; Yuanyuan Zhao; Shuang Xu; Bo Xu Adobe PDF(483Kb)  |  收藏  |  浏览/下载:52/15  |  提交时间:2024/05/29 |
| Enhancing Multi-agent Coordination via Dual-channel Consensus 期刊论文 Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 349-368 作者: Qingyang Zhang; Kaishen Wang; Jingqing Ruan; Yiming Yang; Dengpeng Xing; Bo Xu Adobe PDF(4997Kb)  |  收藏  |  浏览/下载:65/23  |  提交时间:2024/04/23 Multi-agent reinforcement learning, contrastive representation learning, consensus, multi-agent cooperation, cognitive consistency |
| Offline Pre-trained Multi-agent Decision Transformer 期刊论文 Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 233-248 作者: Linghui Meng; Muning Wen; Chenyang Le; Xiyun Li; Dengpeng Xing; Weinan Zhang; Ying Wen; Haifeng Zhang; Jun Wang; Yaodong Yang; Bo Xu Adobe PDF(2121Kb)  |  收藏  |  浏览/下载:54/13  |  提交时间:2024/04/23 Pre-training model multi-agent reinforcement learning (MARL) decision making transformer offline reinforcement
learning |
| Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition 会议论文 , Washington D.C., USA, 2023-2-9 作者: Qingyu Wang; Tielin Zhang; Minglun Han; Yi Wang; Duzhen Zhang; Bo Xu Adobe PDF(1714Kb)  |  收藏  |  浏览/下载:180/51  |  提交时间:2023/06/20 |
| IMPROVING CROSS-MODAL UNDERSTANDING IN VISUAL DIALOG VIA CONTRASTIVE LEARNING 会议论文 , Singapore, 2022.5 作者: Feilong Chen; Duzhen Zhang; Xiuyi Chen; Jing Shi; Shuang Xu; Bo Xu Adobe PDF(9035Kb)  |  收藏  |  浏览/下载:235/99  |  提交时间:2023/06/07 |
| Unsupervised and Pseudo-Supervised Vision-Language Alignment in Visual Dialog 会议论文 , Lisboa, Portugal, October 10–14, 2022 作者: Feilong Chen; Duzhen Zhang; Xiuyi Chen; Jing Shi; Shang Xu; Bo Xu Adobe PDF(9035Kb)  |  收藏  |  浏览/下载:276/154  |  提交时间:2023/06/05 |