已选(0)清除
条数/页: 排序方式: |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang ; Yang Yiming ; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(7948Kb)  |   收藏  |  浏览/下载:19/7  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang ; Zhang Hongming; Xing Dengpeng ; Bo Xu![](/image/person.jpg)
Adobe PDF(9639Kb)  |   收藏  |  浏览/下载:12/6  |  提交时间:2024/06/25 |
| LEGO: A Multi-agent Collaborative Framework with Role-playing and Iterative Feedback for Causality Explanation Generation 会议论文 , Singapore, 2023-12 作者: Zhitao He ; Pengfei Cao ; Yubo Chen ; Kang Liu; Jun Zhao![](/image/person.jpg)
Adobe PDF(1153Kb)  |   收藏  |  浏览/下载:11/3  |  提交时间:2024/06/25 |
| Improving Generalization of Multi-agent Reinforcement Learning through Domain-Invariant Feature Extraction 会议论文 , Greece, 2023-5 作者: Xu YF(徐一凡) ; Pu ZQ(蒲志强) ; Cai QA(蔡奇昂) ; Li FM(李非墨) ; Chai XH(柴兴华)
Adobe PDF(7610Kb)  |   收藏  |  浏览/下载:11/5  |  提交时间:2024/06/21 |
| P-vectors: A Parallel-Coupled TDNN/Transformer Network for Speaker Verification 会议论文 , Dublin, Ireland, 2023.08.24 作者: Wang XY(王溪源) ; Wang FY(王方圆) ; Xu B(徐波) ; Xu L(徐亮); Xiao J(肖京)
Adobe PDF(1542Kb)  |   收藏  |  浏览/下载:48/12  |  提交时间:2024/06/12 |
| Lead ASR Models to Generalize Better Using Approximated Bias-Variance Tradeof 会议论文 , changsha,China, 2023.11.13 作者: Wang FY(王方圆) ; Ming Hao; Yuhai Shi; Bo Xu![](/image/person.jpg)
Adobe PDF(1933Kb)  |   收藏  |  浏览/下载:38/15  |  提交时间:2024/06/12 |
| M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文 , London, United Kingdom, 2023.5.29-2023.6.2 作者: Meng Linghui ; Ruan Jingqing; Xiong Xuantang; Li Xiyun ; Zhang Xi; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(1302Kb)  |   收藏  |  浏览/下载:23/5  |  提交时间:2024/06/11 |
| Generative Calibration for In-context Learning 会议论文 , Singapore, 2023-10-6 作者: Zhongtao Jiang ; Yuanzhe Zhang ; Cao Liu ; Jun Zhao ; Kang Liu
Adobe PDF(763Kb)  |   收藏  |  浏览/下载:28/10  |  提交时间:2024/06/06 |
| Interpreting Sentiment Composition with Latent Semantic Tree 会议论文 , Toronto, Canada, 2023-7-9 作者: Zhongtao Jiang ; Yuanzhe Zhang ; Cao Liu ; Jiansong Chen; Jun Zhao ; Kang Liu
Adobe PDF(509Kb)  |   收藏  |  浏览/下载:37/17  |  提交时间:2024/06/06 |
| Minimizing Age of Information for Mobile Edge Computing Systems: A Nested Index Approach 会议论文 , Singapore, 2023/8/24-27 作者: Chen,Shuo; Yang,Ning ; Zhang,Meng ; Wang,Jun
Adobe PDF(1413Kb)  |   收藏  |  浏览/下载:41/10  |  提交时间:2024/06/05 |