已选(0)清除
条数/页: 排序方式: |
| Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文 , Nanjing, 2023-11-27 作者: Yuqiao Wu ; Haifeng Zhang; Jun Wang
Adobe PDF(1339Kb)  |   收藏  |  浏览/下载:26/8  |  提交时间:2024/07/12 |
| AdaNSP: Uncertainty-driven Adaptive Decoding in Neural Semantic Parsing 会议论文 Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, 2019-07 作者: Zhang X(张翔) ; He SZ(何世柱) ; Liu K(刘康) ; Zhao J(赵军)![](/image/person.jpg)
Adobe PDF(400Kb)  |   收藏  |  浏览/下载:32/9  |  提交时间:2024/06/26 |
| Online Optimization of Normalized CPGs for a Multi-Joint Robotic Fish 会议论文 , 中国,上海, 2021年7月 作者: Tong R(仝茹) ; Wu ZX(吴正兴) ; Wang J(王健); Tan M(谭民) ; Yu JZ(喻俊志)![](/image/person.jpg)
Adobe PDF(456Kb)  |   收藏  |  浏览/下载:32/18  |  提交时间:2024/06/26 |
| Improving Generalization of Multi-agent Reinforcement Learning through Domain-Invariant Feature Extraction 会议论文 , Greece, 2023-5 作者: Xu YF(徐一凡) ; Pu ZQ(蒲志强) ; Cai QA(蔡奇昂) ; Li FM(李非墨) ; Chai XH(柴兴华)
Adobe PDF(7610Kb)  |   收藏  |  浏览/下载:28/12  |  提交时间:2024/06/21 |
| A Double-Observation Policy Learning Framework for Multi-target Coverage with Connectivity Maintenance 会议论文 , online, 2022-2 作者: Xu YF(徐一凡) ; Pu ZQ(蒲志强) ; Wu SG(吴士广) ; Liu BY(刘博寅); Yi JQ(易建强) ; Geng HJ(耿虎军); Chai XH(柴兴华)
Adobe PDF(9582Kb)  |   收藏  |  浏览/下载:25/8  |  提交时间:2024/06/21 |
| MoDE-CoTD: Chain-of-Thought Distillation for Complex Reasoning Tasks with Mixture of Decoupled LoRA-Experts 会议论文 , Torino (Italia), 2024.5.20 - 2024.5.25 作者: Xiang Li ; Shizhu He ; Jiayu Wu; Zhao Yang; Yao Xu; Yang Jun; Haifeng Liu; Kang Liu; Jun Zhao![](/image/person.jpg)
Adobe PDF(1062Kb)  |   收藏  |  浏览/下载:40/11  |  提交时间:2024/06/20 |
| Multi-Scale Dynamic Coding Improved Spiking Actor Network for Reinforcement Learning 会议论文 , Online, February 22–March 1, 2022 作者: Zhang, Duzhen ; Zhang, Tielin ; Jia, Shuncheng; Xu, Bo![](/image/person.jpg)
Adobe PDF(2249Kb)  |   收藏  |  浏览/下载:43/15  |  提交时间:2024/06/11 |
| Learning in bi-level markov games 会议论文 , Padua, Italy, 2022.7.18-2022.7.23 作者: Meng Linghui ; Ruan Jingqing; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(1450Kb)  |   收藏  |  浏览/下载:50/21  |  提交时间:2024/06/11 |
| M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文 , London, United Kingdom, 2023.5.29-2023.6.2 作者: Meng Linghui ; Ruan Jingqing; Xiong Xuantang; Li Xiyun ; Zhang Xi; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(1302Kb)  |   收藏  |  浏览/下载:39/11  |  提交时间:2024/06/11 |
| Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文 , Turin, Italy, 2023.9.18-2023.9.22 作者: Meng Linghui ; Xiong Xuantang; Zang Yifan; Zhang Xi; Li Guoqi ; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(841Kb)  |   收藏  |  浏览/下载:50/19  |  提交时间:2024/06/11 |