CASIA OpenIR

浏览/检索结果: 共48条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文
, Nanjing, 2023-11-27
作者:  Yuqiao Wu;  Haifeng Zhang;  Jun Wang
Adobe PDF(1339Kb)  |  收藏  |  浏览/下载:32/10  |  提交时间:2024/07/12
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:49/22  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:29/13  |  提交时间:2024/06/25
A Survey of Recent Advances in Commonsense Knowledge Acquisition: Methods and Resources 期刊论文
Machine Intelligence Research, 2024, 页码: 1
作者:  Wang, Chenhao;  Li, Jiachun;  Chen, Yubo;  Liu, Kang;  Zhao, Jun
Adobe PDF(1228Kb)  |  收藏  |  浏览/下载:31/8  |  提交时间:2024/06/25
Robust Single-particle Cryo-EM Image Denoising and Restoration 会议论文
, Seoul, Korea,, 14-19 April 2024
作者:  Zhang Jing;  Tengfei Zhao;  ShiYu Hu;  Xin Zhao
Adobe PDF(966Kb)  |  收藏  |  浏览/下载:53/15  |  提交时间:2024/06/21
Multi-Scale Dynamic Coding Improved Spiking Actor Network for Reinforcement Learning 会议论文
, Online, February 22–March 1, 2022
作者:  Zhang, Duzhen;  Zhang, Tielin;  Jia, Shuncheng;  Xu, Bo
Adobe PDF(2249Kb)  |  收藏  |  浏览/下载:48/16  |  提交时间:2024/06/11
Representative Demonstration Selection for In-Context Learning with Two-Stage Determinantal Point Process 会议论文
, Singapore, 2023-12
作者:  Zhao Yang;  Yuanzhe Zhang;  Dianbo Sui;  Cao Liu;  Jun Zhao;  Kang Liu
Adobe PDF(592Kb)  |  收藏  |  浏览/下载:62/27  |  提交时间:2024/05/30
Unsupervised Dialogue State Tracking for End-to-End Task-Oriented Dialogue with a Multi-Span Prediction Network 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 卷号: 38, 期号: 4, 页码: 834-852
作者:  Liu, Qing-Bin;  He, Shi-Zhu;  Liu, Cao;  Liu, Kang;  Zhao, Jun
收藏  |  浏览/下载:67/0  |  提交时间:2024/02/22
end-to-end task-oriented dialogue  dialogue state tracking (DST)  unsupervised learning  reinforcement learning  
Counterfactual Supporting Facts Extraction for Explainable Medical Record Based Diagnosis with Graph Network 会议论文
, Online, June 6–11, 2021
作者:  Wu HR(吴浩然);  Chen W(陈炜);  Xu S(徐爽);  Xu B(徐波)
Adobe PDF(1394Kb)  |  收藏  |  浏览/下载:230/69  |  提交时间:2023/06/26
Joint Modeling of Document and Label with Clause Interaction Hypergraph for ICD Medical Code Assignment 会议论文
, Padua, Italy, 18-23 July 2022
作者:  Wu HR(吴浩然);  Meng LH(孟令辉);  Xu S(徐爽);  Xu B(徐波)
Adobe PDF(612Kb)  |  收藏  |  浏览/下载:114/46  |  提交时间:2023/06/26