已选(0)清除
条数/页: 排序方式: |
| Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文 , Nanjing, 2023-11-27 作者: Yuqiao Wu ; Haifeng Zhang; Jun Wang
Adobe PDF(1339Kb)  |   收藏  |  浏览/下载:33/10  |  提交时间:2024/07/12 |
| Learning State-Specific Action Masks for Reinforcement Learning 期刊论文 Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60 作者: Wang ZY(王梓薏) ; Li XR(李欣然); Sun LY(孙罗洋); Zhang HF(张海峰); Liu HL(刘华林); Jun Wang
Adobe PDF(2976Kb)  |   收藏  |  浏览/下载:52/23  |  提交时间:2024/07/05 reinforcement learning exploration efficiency space reduction |
| UNSUPERVISED LEARNING OF NEURAL SEMANTIC MAPPINGS WITH THE HUNGARIAN ALGORITHM FOR COMPOSITIONAL SEMANTICS 会议论文 , Seoul, South Korea, 2024-04 作者: Zhang X(张翔) ; He SZ(何世柱) ; Liu K(刘康) ; Zhao J(赵军)![](/image/person.jpg)
Adobe PDF(294Kb)  |   收藏  |  浏览/下载:58/26  |  提交时间:2024/06/27 |
| On the Effects of Structural Modeling for Neural Semantic Parsing 会议论文 Proceedings of the 27th Conference on Computational Natural Language Learning (CoNLL), Singapore, Singapore, 2023-12 作者: Zhang X(张翔) ; He SZ(何世柱) ; Liu K(刘康) ; Zhao J(赵军)![](/image/person.jpg)
Adobe PDF(730Kb)  |   收藏  |  浏览/下载:45/23  |  提交时间:2024/06/27 |
| AdaNSP: Uncertainty-driven Adaptive Decoding in Neural Semantic Parsing 会议论文 Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, 2019-07 作者: Zhang X(张翔) ; He SZ(何世柱) ; Liu K(刘康) ; Zhao J(赵军)![](/image/person.jpg)
Adobe PDF(400Kb)  |   收藏  |  浏览/下载:40/11  |  提交时间:2024/06/26 |
| Generative Calibration for In-context Learning 会议论文 , Singapore, 2023-10-6 作者: Zhongtao Jiang ; Yuanzhe Zhang ; Cao Liu ; Jun Zhao ; Kang Liu
Adobe PDF(763Kb)  |   收藏  |  浏览/下载:51/22  |  提交时间:2024/06/06 |
| Interpreting Sentiment Composition with Latent Semantic Tree 会议论文 , Toronto, Canada, 2023-7-9 作者: Zhongtao Jiang ; Yuanzhe Zhang ; Cao Liu ; Jiansong Chen; Jun Zhao ; Kang Liu
Adobe PDF(509Kb)  |   收藏  |  浏览/下载:57/22  |  提交时间:2024/06/06 |
| Alignment Rationale for Natural Language Inference 会议论文 , Online, 2021-8-1 作者: Zhongtao Jiang ; Yuanzhe Zhang ; Zhao Yang; Jun Zhao ; Kang Liu
Adobe PDF(1280Kb)  |   收藏  |  浏览/下载:55/20  |  提交时间:2024/06/06 |
| Token-level Direct Preference Optimization 会议论文 , Vienna, Austria, 2024/7/21-27 作者: Zeng,Yongcheng; Liu,Guoqing ; Ma,Weiyu; Yang,Ning ; Zhang,Haifeng; Wang,Jun
Adobe PDF(883Kb)  |   收藏  |  浏览/下载:82/27  |  提交时间:2024/06/05 |
| Joint caching and transmission in the mobile edge network: An multi-agent learning approach 会议论文 , Madrid, Spain, 2021-12-7 作者: Mi,Qirui; Yang,Ning ; Zhang,Haifeng; Zhang,Haijun; Wang,Jun
Adobe PDF(1724Kb)  |   收藏  |  浏览/下载:53/14  |  提交时间:2024/06/05 |