已选(0)清除
条数/页: 排序方式: |
| MoDE-CoTD: Chain-of-Thought Distillation for Complex Reasoning Tasks with Mixture of Decoupled LoRA-Experts 会议论文 , Torino (Italia), 2024.5.20 - 2024.5.25 作者: Xiang Li ; Shizhu He ; Jiayu Wu; Zhao Yang; Yao Xu; Yang Jun; Haifeng Liu; Kang Liu; Jun Zhao![](/image/person.jpg)
Adobe PDF(1062Kb)  |   收藏  |  浏览/下载:3/1  |  提交时间:2024/06/20 |
| Teaching Small Language Models to Reason for Knowledge-Intensive Multi-Hop Question Answering 会议论文 , Bangkok, Thailand, 2024.08.11-2024.08.16 作者: Xiang Li ; Shizhu HE ; Fangyu Lei; Jun Yang; Tianhuang Su; Kang Liu ; Jun Zhao![](/image/person.jpg)
Adobe PDF(873Kb)  |   收藏  |  浏览/下载:9/2  |  提交时间:2024/06/20 |
| Shifted Chunk Encoder for Transformer Based Streaming End-to-End ASR 会议论文 , Indore,India, 2022.11.28 作者: Wang FY(王方圆) ; Xu B(徐波)![](/image/person.jpg)
Adobe PDF(1374Kb)  |   收藏  |  浏览/下载:12/2  |  提交时间:2024/06/13 |
| Power Control Based on Deep Reinforcement Learning for Spectrum Sharing 期刊论文 IEEE Transactions on Wireless Communications, 2024, 卷号: 19, 期号: 6, 页码: 4209-4219 作者: Zhang,Haijun; Yang,Ning ; Huangfu,Wei; Long,Keping; Leung,VictorCM
Adobe PDF(1925Kb)  |   收藏  |  浏览/下载:12/7  |  提交时间:2024/06/12 |
| M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文 , London, United Kingdom, 2023.5.29-2023.6.2 作者: Meng Linghui ; Ruan Jingqing; Xiong Xuantang; Li Xiyun ; Zhang Xi ; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(1302Kb)  |   收藏  |  浏览/下载:6/2  |  提交时间:2024/06/11 |
| Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文 , Turin, Italy, 2023.9.18-2023.9.22 作者: Meng Linghui ; Xiong Xuantang; Zang Yifan; Zhang Xi ; Li Guoqi ; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(841Kb)  |   收藏  |  浏览/下载:11/5  |  提交时间:2024/06/11 |
| NA-CPG: A robust and stable rhythm generator for robot motion control 期刊论文 Biomimetic Intelligence and Robotics, 2022, 页码: 100075 作者: Tong Ru ; Qiu Changlin ; Wu Zhengxing ; Wang Jian; Tan Min ; Yu Junzhi![](/image/person.jpg)
Adobe PDF(1446Kb)  |   收藏  |  浏览/下载:7/1  |  提交时间:2024/06/06 |
| Alignment Rationale for Natural Language Inference 会议论文 , Online, 2021-8-1 作者: Zhongtao Jiang ; Yuanzhe Zhang ; Zhao Yang; Jun Zhao ; Kang Liu
Adobe PDF(1280Kb)  |   收藏  |  浏览/下载:11/5  |  提交时间:2024/06/06 |
| A cooperation and decision-making framework in dynamic confrontation for multi-agent systems 期刊论文 Computers and Electrical Engineering, 2024, 页码: 118 作者: Lexing Wang; Tenghai Qiu ; Zhiqiang Pu ; Jianqiang Yi![](/image/person.jpg)
Adobe PDF(1302Kb)  |   收藏  |  浏览/下载:17/4  |  提交时间:2024/06/06 |
| Token-level Direct Preference Optimization 会议论文 , Vienna, Austria, 2024/7/21-27 作者: Zeng,Yongcheng; Liu,Guoqing ; Ma,Weiyu; Yang,Ning ; Zhang,Haifeng; Wang,Jun
Adobe PDF(883Kb)  |   收藏  |  浏览/下载:32/10  |  提交时间:2024/06/05 |