CASIA OpenIR

浏览/检索结果: 共159条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Teaching Small Language Models to Reason for Knowledge-Intensive Multi-Hop Question Answering 会议论文
, Bangkok, Thailand, 2024.08.11-2024.08.16
作者:  Xiang Li;  Shizhu HE;  Fangyu Lei;  Jun Yang;  Tianhuang Su;  Kang Liu;  Jun Zhao
Adobe PDF(873Kb)  |  收藏  |  浏览/下载:9/2  |  提交时间:2024/06/20
Immersion and Invariance Based Composite Adaptive Control for Nonlinear Systems with Both Parametric and Non-Parametric Uncertainties 会议论文
, Berlin, Germany, 2020.7.12-17
作者:  Zhen Liu;  Zhiqiang Pu;  Tenghai Qiu;  Huimu Wang;  Jianqiang Yi
Adobe PDF(1554Kb)  |  收藏  |  浏览/下载:6/2  |  提交时间:2024/06/20
Controller Design and Stability Analysis for Spinning Missile Via Tensor Product 期刊论文
Aerospace Science and Technology, 2022, 页码: 107877
作者:  Zhiming Zhou;  Zhen Liu;  Yi Pan;  Jianqiang Yi
Adobe PDF(1047Kb)  |  收藏  |  浏览/下载:5/1  |  提交时间:2024/06/20
Multi-Scale Dynamic Coding Improved Spiking Actor Network for Reinforcement Learning 会议论文
, Online, February 22–March 1, 2022
作者:  Zhang, Duzhen;  Zhang, Tielin;  Jia, Shuncheng;  Xu, Bo
Adobe PDF(2249Kb)  |  收藏  |  浏览/下载:9/4  |  提交时间:2024/06/11
Learning in bi-level markov games 会议论文
, Padua, Italy, 2022.7.18-2022.7.23
作者:  Meng Linghui;  Ruan Jingqing;  Xing Dengpeng;  Xu Bo
Adobe PDF(1450Kb)  |  收藏  |  浏览/下载:12/4  |  提交时间:2024/06/11
Alignment Rationale for Natural Language Inference 会议论文
, Online, 2021-8-1
作者:  Zhongtao Jiang;  Yuanzhe Zhang;  Zhao Yang;  Jun Zhao;  Kang Liu
Adobe PDF(1280Kb)  |  收藏  |  浏览/下载:11/5  |  提交时间:2024/06/06
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
作者:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  收藏  |  浏览/下载:32/10  |  提交时间:2024/06/05
Learning Superior Cooperative Policy in Competitive Multi-team Reinforcement Learning 会议论文
, Gold Coast, Australia, 2023-6
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Xiaolin Ai;  Wanmai Yuan
Adobe PDF(25675Kb)  |  收藏  |  浏览/下载:18/3  |  提交时间:2024/06/05
A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning 会议论文
, Padua, Italy, 2022年07月
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Wanmai Yuan
Adobe PDF(2650Kb)  |  收藏  |  浏览/下载:13/3  |  提交时间:2024/06/05
Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems 会议论文
, online, 2022
作者:  Qingxu Fu;  Tenghai Qiu;  Jianqiang Yi;  Zhiqiang Pu;  Shiguang Wu
Adobe PDF(5807Kb)  |  收藏  |  浏览/下载:17/5  |  提交时间:2024/06/05