CASIA OpenIR

浏览/检索结果: 共23条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
On the Effects of Structural Modeling for Neural Semantic Parsing 会议论文
Proceedings of the 27th Conference on Computational Natural Language Learning (CoNLL), Singapore, Singapore, 2023-12
作者:  Zhang X(张翔);  He SZ(何世柱);  Liu K(刘康);  Zhao J(赵军)
Adobe PDF(730Kb)  |  收藏  |  浏览/下载:11/7  |  提交时间:2024/06/27
Interpreting Sentiment Composition with Latent Semantic Tree 会议论文
, Toronto, Canada, 2023-7-9
作者:  Zhongtao Jiang;  Yuanzhe Zhang;  Cao Liu;  Jiansong Chen;  Jun Zhao;  Kang Liu
Adobe PDF(509Kb)  |  收藏  |  浏览/下载:31/15  |  提交时间:2024/06/06
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
作者:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  收藏  |  浏览/下载:42/13  |  提交时间:2024/06/05
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning 会议论文
, New Orleans, LA, USA,, November 28 - December 9, 2022
作者:  Zhiwei Xu;  Dapeng Li;  Bin Zhang;  Yuan Zhan;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(4367Kb)  |  收藏  |  浏览/下载:23/4  |  提交时间:2024/05/28
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Shenzhen, China, 18-22 July 2021
作者:  Zhiwei Xu;  Dapeng Li;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(3892Kb)  |  收藏  |  浏览/下载:13/7  |  提交时间:2024/05/28
Learning Cooperative Policies with Graph Networks in Distributed Swarm Systems 会议论文
, Queensland, Australia, June 18-23, 2023
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强);  Ai XL(艾晓琳);  Yuan GM(袁莞迈)
Adobe PDF(612Kb)  |  收藏  |  浏览/下载:177/52  |  提交时间:2023/06/12
Hierarchical Cooperative Swarm Policy Learning with Role Emergence 会议论文
, Online, 05-07 December 2021
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Qiu TH(丘腾海);  Yi JQ(易建强)
Adobe PDF(327Kb)  |  收藏  |  浏览/下载:150/63  |  提交时间:2023/06/12
Intrinsic Reward with Peer Incentives for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Online, 18-23 July 2022
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Wu SG(吴士广);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(2189Kb)  |  收藏  |  浏览/下载:214/59  |  提交时间:2023/06/12
Multi-UAV Cooperative Short-Range Combat via Attention-Based Reinforcement Learning using Individual Reward Shaping 会议论文
, Kyoto, Japan, October 23-27, 2022
作者:  Zhang TL(张天乐);  Qiu TH(丘腾海);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(896Kb)  |  收藏  |  浏览/下载:159/50  |  提交时间:2023/06/12
Multi-Target Encirclement with Collision Avoidance via Deep Reinforcement Learning using Relational Graphs 会议论文
, Philadelphia, PA, USA, May 23-27, 2022
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(4277Kb)  |  收藏  |  浏览/下载:159/36  |  提交时间:2023/06/12