CASIA OpenIR

浏览/检索结果: 共177条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
On the Effects of Structural Modeling for Neural Semantic Parsing 会议论文
Proceedings of the 27th Conference on Computational Natural Language Learning (CoNLL), Singapore, Singapore, 2023-12
作者:  Zhang X(张翔);  He SZ(何世柱);  Liu K(刘康);  Zhao J(赵军)
Adobe PDF(730Kb)  |  收藏  |  浏览/下载:33/19  |  提交时间:2024/06/27
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:35/14  |  提交时间:2024/06/25
强化学习,分层强化学习  
Learning to Align Question and Answer Utterances in Customer Service Conversation with Recurrent Pointer Networks 会议论文
, Honolulu, Hawaii, USA, 2019.01.27 - 2019.02.01
作者:  Shizhu HE;  Kang Liu;  Weiting An
Adobe PDF(1562Kb)  |  收藏  |  浏览/下载:47/18  |  提交时间:2024/06/20
Power Control Based on Deep Reinforcement Learning for Spectrum Sharing 期刊论文
IEEE Transactions on Wireless Communications, 2024, 卷号: 19, 期号: 6, 页码: 4209-4219
作者:  Zhang,Haijun;  Yang,Ning;  Huangfu,Wei;  Long,Keping;  Leung,VictorCM
Adobe PDF(1925Kb)  |  收藏  |  浏览/下载:42/17  |  提交时间:2024/06/12
Learn to flap: foil non-parametric path planning via deep reinforcement learning 期刊论文
Journal of Fluid Mechanics, 2024, 卷号: 984, 页码: A9
作者:  Wang, Zhipeng;  Lin, Runji;  Zhao, Zhiyu;  Chen, Xu;  Guo, Pengming;  Yang, Ning;  Wang,Zhicheng;  Fan, Dixia
Adobe PDF(1892Kb)  |  收藏  |  浏览/下载:49/11  |  提交时间:2024/06/07
Alignment Rationale for Natural Language Inference 会议论文
, Online, 2021-8-1
作者:  Zhongtao Jiang;  Yuanzhe Zhang;  Zhao Yang;  Jun Zhao;  Kang Liu
Adobe PDF(1280Kb)  |  收藏  |  浏览/下载:41/14  |  提交时间:2024/06/06
A cooperation and decision-making framework in dynamic confrontation for multi-agent systems 期刊论文
Computers and Electrical Engineering, 2024, 卷号: 118, 页码: 118
作者:  Lexing Wang;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi
Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:44/13  |  提交时间:2024/06/06
Multi-agent system  Target allocation  Decision making  Swarm motion control  
A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning 会议论文
, Padua, Italy, 2022年07月
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Wanmai Yuan
Adobe PDF(2650Kb)  |  收藏  |  浏览/下载:40/12  |  提交时间:2024/06/05
Continuous Exploration via Multiple Perspectives in Sparse Reward Environment 会议论文
, 厦门国际会议中心, 2023-10-13
作者:  Chen ZP(陈忠鹏);  Guan Q(关强)
Adobe PDF(2260Kb)  |  收藏  |  浏览/下载:36/11  |  提交时间:2024/06/04
Reinforcement Learning · Exploration Strategy · Sparse Reward · Intrinsic Motivation  
Explanation Guided Knowledge Distillation for Pre-trained Language Model Compression 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing, 2024, 卷号: 23, 期号: 2, 页码: 1-19
作者:  Zhao Yang;  Yuanzhe Zhang;  Dianbo Sui;  Yiming Ju;  Jun Zhao;  Kang Liu
Adobe PDF(1250Kb)  |  收藏  |  浏览/下载:57/21  |  提交时间:2024/05/30
Explanation  knowledge distillation  model compression