已选(0)清除
条数/页: 排序方式: |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang ; Yang Yiming ; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(7948Kb)  |   收藏  |  浏览/下载:19/7  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Feature Comparison Based Channel Attention For Fine-Grained Visual Classification 会议论文 , Abu Dhabi, United Arab Emirates, 25-28 October 2020 作者: Shukun Jia; Yan Bai; Zhang Jing![](/image/person.jpg)
Adobe PDF(163Kb)  |   收藏  |  浏览/下载:15/7  |  提交时间:2024/06/21 |
| A Double-Observation Policy Learning Framework for Multi-target Coverage with Connectivity Maintenance 会议论文 , online, 2022-2 作者: Xu YF(徐一凡) ; Pu ZQ(蒲志强) ; Wu SG(吴士广) ; Liu BY(刘博寅); Yi JQ(易建强) ; Geng HJ(耿虎军); Chai XH(柴兴华)
Adobe PDF(9582Kb)  |   收藏  |  浏览/下载:11/3  |  提交时间:2024/06/21 |
| Teaching Small Language Models to Reason for Knowledge-Intensive Multi-Hop Question Answering 会议论文 , Bangkok, Thailand, 2024.08.11-2024.08.16 作者: Xiang Li ; Shizhu HE ; Fangyu Lei; Jun Yang; Tianhuang Su; Kang Liu ; Jun Zhao![](/image/person.jpg)
Adobe PDF(873Kb)  |   收藏  |  浏览/下载:33/11  |  提交时间:2024/06/20 |
| Immersion and Invariance Based Composite Adaptive Control for Nonlinear Systems with Both Parametric and Non-Parametric Uncertainties 会议论文 , Berlin, Germany, 2020.7.12-17 作者: Zhen Liu ; Zhiqiang Pu ; Tenghai Qiu ; Huimu Wang ; Jianqiang Yi![](/image/person.jpg)
Adobe PDF(1554Kb)  |   收藏  |  浏览/下载:35/11  |  提交时间:2024/06/20 |
| Multi-Scale Dynamic Coding Improved Spiking Actor Network for Reinforcement Learning 会议论文 , Online, February 22–March 1, 2022 作者: Zhang, Duzhen ; Zhang, Tielin ; Jia, Shuncheng; Xu, Bo![](/image/person.jpg)
Adobe PDF(2249Kb)  |   收藏  |  浏览/下载:27/10  |  提交时间:2024/06/11 |
| Learning in bi-level markov games 会议论文 , Padua, Italy, 2022.7.18-2022.7.23 作者: Meng Linghui ; Ruan Jingqing; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(1450Kb)  |   收藏  |  浏览/下载:35/12  |  提交时间:2024/06/11 |
| Alignment Rationale for Natural Language Inference 会议论文 , Online, 2021-8-1 作者: Zhongtao Jiang ; Yuanzhe Zhang ; Zhao Yang; Jun Zhao ; Kang Liu
Adobe PDF(1280Kb)  |   收藏  |  浏览/下载:31/12  |  提交时间:2024/06/06 |
| Token-level Direct Preference Optimization 会议论文 , Vienna, Austria, 2024/7/21-27 作者: Zeng,Yongcheng; Liu,Guoqing ; Ma,Weiyu; Yang,Ning ; Zhang,Haifeng; Wang,Jun
Adobe PDF(883Kb)  |   收藏  |  浏览/下载:53/17  |  提交时间:2024/06/05 |
| Learning Superior Cooperative Policy in Competitive Multi-team Reinforcement Learning 会议论文 , Gold Coast, Australia, 2023-6 作者: Qingxu Fu ; Tenghai Qiu ; Zhiqiang Pu ; Jianqiang Yi ; Xiaolin Ai ; Wanmai Yuan
Adobe PDF(25675Kb)  |   收藏  |  浏览/下载:34/5  |  提交时间:2024/06/05 |