已选(0)清除
条数/页: 排序方式: |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang ; Yang Yiming ; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(7948Kb)  |   收藏  |  浏览/下载:4/3  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang ; Zhang Hongming; Xing Dengpeng ; Bo Xu![](/image/person.jpg)
Adobe PDF(9639Kb)  |   收藏  |  浏览/下载:5/3  |  提交时间:2024/06/25 |
| LEGO: A Multi-agent Collaborative Framework with Role-playing and Iterative Feedback for Causality Explanation Generation 会议论文 , Singapore, 2023-12 作者: Zhitao He ; Pengfei Cao ; Yubo Chen ; Kang Liu; Jun Zhao![](/image/person.jpg)
Adobe PDF(1153Kb)  |   收藏  |  浏览/下载:1/1  |  提交时间:2024/06/25 |
| Improving Generalization of Multi-agent Reinforcement Learning through Domain-Invariant Feature Extraction 会议论文 , Greece, 2023-5 作者: Xu YF(徐一凡) ; Pu ZQ(蒲志强) ; Cai QA(蔡奇昂) ; Li FM(李非墨) ; Chai XH(柴兴华)
Adobe PDF(7610Kb)  |   收藏  |  浏览/下载:5/3  |  提交时间:2024/06/21 |
| Bidirectional Sentence Ordering with Interactive Decoding 期刊论文 ACM Transactions on Asian and Low-Resource Language Information Processing, 2023, 卷号: 22, 期号: 2, 页码: 1-15 作者: Guirong Bai ; Shizhu HE ; Kang Liu ; Jun Zhao![](/image/person.jpg)
Adobe PDF(1080Kb)  |   收藏  |  浏览/下载:13/4  |  提交时间:2024/06/20 |
| Prediction and Calibration: Complex Reasoning over Knowledge Graph with Bi-directional Directed Acyclic Graph Neural Network 会议论文 , Toronto, Canada, 2023.07.09-2023.07.14 作者: Yao Xu ; Shizhu HE ; Li Cai; Kang Liu; Jun Zhao![](/image/person.jpg)
Adobe PDF(628Kb)  |   收藏  |  浏览/下载:9/4  |  提交时间:2024/06/20 |
| Query2Triple: Unified Query Encoding for Answering Diverse Complex Queries over Knowledge Graphs 会议论文 , Singapore, 2023.11.06-2023.11.10 作者: Yao Xu ; Shizhu HE ; Cunguang Wang; Li Cai; Kang Liu; Jun Zhao![](/image/person.jpg)
Adobe PDF(811Kb)  |   收藏  |  浏览/下载:6/2  |  提交时间:2024/06/20 |
| P-vectors: A Parallel-Coupled TDNN/Transformer Network for Speaker Verification 会议论文 , Dublin, Ireland, 2023.08.24 作者: Wang XY(王溪源) ; Wang FY(王方圆) ; Xu B(徐波) ; Xu L(徐亮); Xiao J(肖京)
Adobe PDF(1542Kb)  |   收藏  |  浏览/下载:34/9  |  提交时间:2024/06/12 |
| Lead ASR Models to Generalize Better Using Approximated Bias-Variance Tradeof 会议论文 , changsha,China, 2023.11.13 作者: Wang FY(王方圆) ; Ming Hao; Yuhai Shi; Bo Xu![](/image/person.jpg)
Adobe PDF(1933Kb)  |   收藏  |  浏览/下载:26/10  |  提交时间:2024/06/12 |
| M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文 , London, United Kingdom, 2023.5.29-2023.6.2 作者: Meng Linghui ; Ruan Jingqing; Xiong Xuantang; Li Xiyun ; Zhang Xi ; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(1302Kb)  |   收藏  |  浏览/下载:12/3  |  提交时间:2024/06/11 |