已选(0)清除
条数/页: 排序方式: |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang; Yang Yiming; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng; Xu Bo Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:35/14  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Feature Comparison Based Channel Attention For Fine-Grained Visual Classification 会议论文 , Abu Dhabi, United Arab Emirates, 25-28 October 2020 作者: Shukun Jia; Yan Bai; Zhang Jing Adobe PDF(163Kb)  |  收藏  |  浏览/下载:19/9  |  提交时间:2024/06/21 |
| A Double-Observation Policy Learning Framework for Multi-target Coverage with Connectivity Maintenance 会议论文 , online, 2022-2 作者: Xu YF(徐一凡); Pu ZQ(蒲志强); Wu SG(吴士广); Liu BY(刘博寅); Yi JQ(易建强); Geng HJ(耿虎军); Chai XH(柴兴华) Adobe PDF(9582Kb)  |  收藏  |  浏览/下载:22/6  |  提交时间:2024/06/21 |
| Teaching Small Language Models to Reason for Knowledge-Intensive Multi-Hop Question Answering 会议论文 , Bangkok, Thailand, 2024.08.11-2024.08.16 作者: Xiang Li; Shizhu HE; Fangyu Lei; Jun Yang; Tianhuang Su; Kang Liu; Jun Zhao Adobe PDF(873Kb)  |  收藏  |  浏览/下载:41/14  |  提交时间:2024/06/20 |
| Immersion and Invariance Based Composite Adaptive Control for Nonlinear Systems with Both Parametric and Non-Parametric Uncertainties 会议论文 , Berlin, Germany, 2020.7.12-17 作者: Zhen Liu; Zhiqiang Pu; Tenghai Qiu; Huimu Wang; Jianqiang Yi Adobe PDF(1554Kb)  |  收藏  |  浏览/下载:45/17  |  提交时间:2024/06/20 |
| Multi-Scale Dynamic Coding Improved Spiking Actor Network for Reinforcement Learning 会议论文 , Online, February 22–March 1, 2022 作者: Zhang, Duzhen; Zhang, Tielin; Jia, Shuncheng; Xu, Bo Adobe PDF(2249Kb)  |  收藏  |  浏览/下载:38/14  |  提交时间:2024/06/11 |
| Learning in bi-level markov games 会议论文 , Padua, Italy, 2022.7.18-2022.7.23 作者: Meng Linghui; Ruan Jingqing; Xing Dengpeng; Xu Bo Adobe PDF(1450Kb)  |  收藏  |  浏览/下载:44/18  |  提交时间:2024/06/11 |
| Alignment Rationale for Natural Language Inference 会议论文 , Online, 2021-8-1 作者: Zhongtao Jiang; Yuanzhe Zhang; Zhao Yang; Jun Zhao; Kang Liu Adobe PDF(1280Kb)  |  收藏  |  浏览/下载:42/14  |  提交时间:2024/06/06 |
| Token-level Direct Preference Optimization 会议论文 , Vienna, Austria, 2024/7/21-27 作者: Zeng,Yongcheng; Liu,Guoqing; Ma,Weiyu; Yang,Ning; Zhang,Haifeng; Wang,Jun Adobe PDF(883Kb)  |  收藏  |  浏览/下载:66/22  |  提交时间:2024/06/05 |
| Learning Superior Cooperative Policy in Competitive Multi-team Reinforcement Learning 会议论文 , Gold Coast, Australia, 2023-6 作者: Qingxu Fu; Tenghai Qiu; Zhiqiang Pu; Jianqiang Yi; Xiaolin Ai; Wanmai Yuan Adobe PDF(25675Kb)  |  收藏  |  浏览/下载:44/10  |  提交时间:2024/06/05 |