已选(0)清除
条数/页: 排序方式: |
| Teaching Small Language Models to Reason for Knowledge-Intensive Multi-Hop Question Answering 会议论文 , Bangkok, Thailand, 2024.08.11-2024.08.16 作者: Xiang Li; Shizhu HE; Fangyu Lei; Jun Yang; Tianhuang Su; Kang Liu; Jun Zhao Adobe PDF(873Kb)  |  收藏  |  浏览/下载:9/2  |  提交时间:2024/06/20 |
| Immersion and Invariance Based Composite Adaptive Control for Nonlinear Systems with Both Parametric and Non-Parametric Uncertainties 会议论文 , Berlin, Germany, 2020.7.12-17 作者: Zhen Liu; Zhiqiang Pu; Tenghai Qiu; Huimu Wang; Jianqiang Yi Adobe PDF(1554Kb)  |  收藏  |  浏览/下载:6/2  |  提交时间:2024/06/20 |
| Controller Design and Stability Analysis for Spinning Missile Via Tensor Product 期刊论文 Aerospace Science and Technology, 2022, 页码: 107877 作者: Zhiming Zhou; Zhen Liu; Yi Pan; Jianqiang Yi Adobe PDF(1047Kb)  |  收藏  |  浏览/下载:5/1  |  提交时间:2024/06/20 |
| Multi-Scale Dynamic Coding Improved Spiking Actor Network for Reinforcement Learning 会议论文 , Online, February 22–March 1, 2022 作者: Zhang, Duzhen; Zhang, Tielin; Jia, Shuncheng; Xu, Bo Adobe PDF(2249Kb)  |  收藏  |  浏览/下载:9/4  |  提交时间:2024/06/11 |
| Learning in bi-level markov games 会议论文 , Padua, Italy, 2022.7.18-2022.7.23 作者: Meng Linghui; Ruan Jingqing; Xing Dengpeng; Xu Bo Adobe PDF(1450Kb)  |  收藏  |  浏览/下载:12/4  |  提交时间:2024/06/11 |
| Alignment Rationale for Natural Language Inference 会议论文 , Online, 2021-8-1 作者: Zhongtao Jiang; Yuanzhe Zhang; Zhao Yang; Jun Zhao; Kang Liu Adobe PDF(1280Kb)  |  收藏  |  浏览/下载:11/5  |  提交时间:2024/06/06 |
| Token-level Direct Preference Optimization 会议论文 , Vienna, Austria, 2024/7/21-27 作者: Zeng,Yongcheng; Liu,Guoqing; Ma,Weiyu; Yang,Ning; Zhang,Haifeng; Wang,Jun Adobe PDF(883Kb)  |  收藏  |  浏览/下载:32/10  |  提交时间:2024/06/05 |
| Learning Superior Cooperative Policy in Competitive Multi-team Reinforcement Learning 会议论文 , Gold Coast, Australia, 2023-6 作者: Qingxu Fu; Tenghai Qiu; Zhiqiang Pu; Jianqiang Yi; Xiaolin Ai; Wanmai Yuan Adobe PDF(25675Kb)  |  收藏  |  浏览/下载:18/3  |  提交时间:2024/06/05 |
| A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning 会议论文 , Padua, Italy, 2022年07月 作者: Qingxu Fu; Tenghai Qiu; Zhiqiang Pu; Jianqiang Yi; Wanmai Yuan Adobe PDF(2650Kb)  |  收藏  |  浏览/下载:13/3  |  提交时间:2024/06/05 |
| Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems 会议论文 , online, 2022 作者: Qingxu Fu; Tenghai Qiu; Jianqiang Yi; Zhiqiang Pu; Shiguang Wu Adobe PDF(5807Kb)  |  收藏  |  浏览/下载:17/5  |  提交时间:2024/06/05 |