已选(0)清除
条数/页: 排序方式: |
| On the Effects of Structural Modeling for Neural Semantic Parsing 会议论文 Proceedings of the 27th Conference on Computational Natural Language Learning (CoNLL), Singapore, Singapore, 2023-12 作者: Zhang X(张翔); He SZ(何世柱); Liu K(刘康); Zhao J(赵军) Adobe PDF(730Kb)  |  收藏  |  浏览/下载:12/7  |  提交时间:2024/06/27 |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang; Yang Yiming; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng; Xu Bo Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:14/5  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang; Zhang Hongming; Xing Dengpeng; Bo Xu Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:10/5  |  提交时间:2024/06/25 |
| Zero-Shot Cross-Lingual Document-Level Event Causality Identification with Heterogeneous Graph Contrastive Transfer Learning 会议论文 , Torino, Italia, 2024-5 作者: Zhitao He; Pengfei Cao; Zhuoran Jin; Yubo Chen; Kang Liu; Jun Zhao Adobe PDF(794Kb)  |  收藏  |  浏览/下载:10/5  |  提交时间:2024/06/25 |
| LEGO: A Multi-agent Collaborative Framework with Role-playing and Iterative Feedback for Causality Explanation Generation 会议论文 , Singapore, 2023-12 作者: Zhitao He; Pengfei Cao; Yubo Chen; Kang Liu; Jun Zhao Adobe PDF(1153Kb)  |  收藏  |  浏览/下载:6/2  |  提交时间:2024/06/25 |
| MoDE-CoTD: Chain-of-Thought Distillation for Complex Reasoning Tasks with Mixture of Decoupled LoRA-Experts 会议论文 , Torino (Italia), 2024.5.20 - 2024.5.25 作者: Xiang Li; Shizhu He; Jiayu Wu; Zhao Yang; Yao Xu; Yang Jun; Haifeng Liu; Kang Liu; Jun Zhao Adobe PDF(1062Kb)  |  收藏  |  浏览/下载:16/4  |  提交时间:2024/06/20 |
| Learn to flap: foil non-parametric path planning via deep reinforcement learning 期刊论文 Journal of Fluid Mechanics, 2024, 卷号: 984, 页码: A9 作者: Wang, Zhipeng; Lin, Runji; Zhao, Zhiyu; Chen, Xu; Guo, Pengming; Yang, Ning; Wang,Zhicheng; Fan, Dixia Adobe PDF(1892Kb)  |  收藏  |  浏览/下载:31/5  |  提交时间:2024/06/07 |
| Interpreting Sentiment Composition with Latent Semantic Tree 会议论文 , Toronto, Canada, 2023-7-9 作者: Zhongtao Jiang; Yuanzhe Zhang; Cao Liu; Jiansong Chen; Jun Zhao; Kang Liu Adobe PDF(509Kb)  |  收藏  |  浏览/下载:35/16  |  提交时间:2024/06/06 |
| Token-level Direct Preference Optimization 会议论文 , Vienna, Austria, 2024/7/21-27 作者: Zeng,Yongcheng; Liu,Guoqing; Ma,Weiyu; Yang,Ning; Zhang,Haifeng; Wang,Jun Adobe PDF(883Kb)  |  收藏  |  浏览/下载:47/14  |  提交时间:2024/06/05 |
| Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems 会议论文 , online, 2022 作者: Qingxu Fu; Tenghai Qiu; Jianqiang Yi; Zhiqiang Pu; Shiguang Wu Adobe PDF(5807Kb)  |  收藏  |  浏览/下载:25/7  |  提交时间:2024/06/05 |