已选(0)清除
条数/页: 排序方式: |
| QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning 期刊论文 IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 12 作者: Liu BY(刘博寅)![](/image/person.jpg)
Adobe PDF(6675Kb)  |   收藏  |  浏览/下载:29/5  |  提交时间:2024/07/12 |
| Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文 , Nanjing, 2023-11-27 作者: Yuqiao Wu ; Haifeng Zhang; Jun Wang
Adobe PDF(1339Kb)  |   收藏  |  浏览/下载:33/10  |  提交时间:2024/07/12 |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang ; Yang Yiming ; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(7948Kb)  |   收藏  |  浏览/下载:45/17  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang ; Zhang Hongming; Xing Dengpeng ; Bo Xu![](/image/person.jpg)
Adobe PDF(9639Kb)  |   收藏  |  浏览/下载:29/13  |  提交时间:2024/06/25 |
| Zero-Shot Cross-Lingual Document-Level Event Causality Identification with Heterogeneous Graph Contrastive Transfer Learning 会议论文 , Torino, Italia, 2024-5 作者: Zhitao He ; Pengfei Cao ; Zhuoran Jin ; Yubo Chen ; Kang Liu; Jun Zhao![](/image/person.jpg)
Adobe PDF(794Kb)  |   收藏  |  浏览/下载:50/25  |  提交时间:2024/06/25 |
| CLUSTER CONSTRAINTBASEDSPARSENMFFORHYPERSPECTRALIMAGERY UNMIXING 会议论文 , 法国巴黎, 10月27-30日 作者: Jiang XW(蒋心为) ; Ma L(马雷) ; Yang YP(杨一平)![](/image/person.jpg)
Adobe PDF(261Kb)  |   收藏  |  浏览/下载:37/17  |  提交时间:2024/06/24 |
| MoDE-CoTD: Chain-of-Thought Distillation for Complex Reasoning Tasks with Mixture of Decoupled LoRA-Experts 会议论文 , Torino (Italia), 2024.5.20 - 2024.5.25 作者: Xiang Li ; Shizhu He ; Jiayu Wu; Zhao Yang; Yao Xu; Yang Jun; Haifeng Liu; Kang Liu; Jun Zhao![](/image/person.jpg)
Adobe PDF(1062Kb)  |   收藏  |  浏览/下载:44/12  |  提交时间:2024/06/20 |
| Teaching Small Language Models to Reason for Knowledge-Intensive Multi-Hop Question Answering 会议论文 , Bangkok, Thailand, 2024.08.11-2024.08.16 作者: Xiang Li ; Shizhu HE ; Fangyu Lei; Jun Yang; Tianhuang Su; Kang Liu ; Jun Zhao![](/image/person.jpg)
Adobe PDF(873Kb)  |   收藏  |  浏览/下载:51/18  |  提交时间:2024/06/20 |
| Shifted Chunk Encoder for Transformer Based Streaming End-to-End ASR 会议论文 , Indore,India, 2022.11.28 作者: Wang FY(王方圆) ; Xu B(徐波)![](/image/person.jpg)
Adobe PDF(1374Kb)  |   收藏  |  浏览/下载:59/20  |  提交时间:2024/06/13 |
| Lead ASR Models to Generalize Better Using Approximated Bias-Variance Tradeof 会议论文 , changsha,China, 2023.11.13 作者: Wang FY(王方圆) ; Ming Hao; Yuhai Shi; Bo Xu![](/image/person.jpg)
Adobe PDF(1933Kb)  |   收藏  |  浏览/下载:64/23  |  提交时间:2024/06/12 |