已选(0)清除
条数/页: 排序方式: |
| MoDE-CoTD: Chain-of-Thought Distillation for Complex Reasoning Tasks with Mixture of Decoupled LoRA-Experts 会议论文 , Torino (Italia), 2024.5.20 - 2024.5.25 作者: Xiang Li; Shizhu He; Jiayu Wu; Zhao Yang; Yao Xu; Yang Jun; Haifeng Liu; Kang Liu; Jun Zhao Adobe PDF(1062Kb)  |  收藏  |  浏览/下载:4/2  |  提交时间:2024/06/20 |
| Teaching Small Language Models to Reason for Knowledge-Intensive Multi-Hop Question Answering 会议论文 , Bangkok, Thailand, 2024.08.11-2024.08.16 作者: Xiang Li; Shizhu HE; Fangyu Lei; Jun Yang; Tianhuang Su; Kang Liu; Jun Zhao Adobe PDF(873Kb)  |  收藏  |  浏览/下载:12/4  |  提交时间:2024/06/20 |
| Shifted Chunk Encoder for Transformer Based Streaming End-to-End ASR 会议论文 , Indore,India, 2022.11.28 作者: Wang FY(王方圆); Xu B(徐波) Adobe PDF(1374Kb)  |  收藏  |  浏览/下载:17/5  |  提交时间:2024/06/13 |
| M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文 , London, United Kingdom, 2023.5.29-2023.6.2 作者: Meng Linghui; Ruan Jingqing; Xiong Xuantang; Li Xiyun; Zhang Xi; Xing Dengpeng; Xu Bo Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:10/3  |  提交时间:2024/06/11 |
| Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文 , Turin, Italy, 2023.9.18-2023.9.22 作者: Meng Linghui; Xiong Xuantang; Zang Yifan; Zhang Xi; Li Guoqi; Xing Dengpeng; Xu Bo Adobe PDF(841Kb)  |  收藏  |  浏览/下载:15/7  |  提交时间:2024/06/11 |
| Alignment Rationale for Natural Language Inference 会议论文 , Online, 2021-8-1 作者: Zhongtao Jiang; Yuanzhe Zhang; Zhao Yang; Jun Zhao; Kang Liu Adobe PDF(1280Kb)  |  收藏  |  浏览/下载:14/6  |  提交时间:2024/06/06 |
| Token-level Direct Preference Optimization 会议论文 , Vienna, Austria, 2024/7/21-27 作者: Zeng,Yongcheng; Liu,Guoqing; Ma,Weiyu; Yang,Ning; Zhang,Haifeng; Wang,Jun Adobe PDF(883Kb)  |  收藏  |  浏览/下载:34/12  |  提交时间:2024/06/05 |
| Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems 会议论文 , online, 2022 作者: Qingxu Fu; Tenghai Qiu; Jianqiang Yi; Zhiqiang Pu; Shiguang Wu Adobe PDF(5807Kb)  |  收藏  |  浏览/下载:19/6  |  提交时间:2024/06/05 |
| Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural language. 会议论文 , 北京华腾美居酒店, 2023-12-9 作者: Zhourui Guo; Meng Yao; Yang Yu; Qiyue Yin Adobe PDF(2302Kb)  |  收藏  |  浏览/下载:8/3  |  提交时间:2024/06/03 |
| Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文 Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8 作者: He SQ(何少钦); Gao Y(高阳); Zhang BF(张保丰); Chang H(常惠); Zhang XC(张鑫辰) Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:24/11  |  提交时间:2024/05/31 Air Combat, Reinforcement Learning, Neural Fictitious Self-Play. |