CASIA OpenIR

浏览/检索结果: 共62条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
On the Effects of Structural Modeling for Neural Semantic Parsing 会议论文
Proceedings of the 27th Conference on Computational Natural Language Learning (CoNLL), Singapore, Singapore, 2023-12
作者:  Zhang X(张翔);  He SZ(何世柱);  Liu K(刘康);  Zhao J(赵军)
Adobe PDF(730Kb)  |  收藏  |  浏览/下载:29/18  |  提交时间:2024/06/27
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:30/12  |  提交时间:2024/06/25
强化学习,分层强化学习  
Bidirectional Sentence Ordering with Interactive Decoding 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing, 2023, 卷号: 22, 期号: 2, 页码: 1-15
作者:  Guirong Bai;  Shizhu HE;  Kang Liu;  Jun Zhao
Adobe PDF(1080Kb)  |  收藏  |  浏览/下载:27/11  |  提交时间:2024/06/20
Power Control Based on Deep Reinforcement Learning for Spectrum Sharing 期刊论文
IEEE Transactions on Wireless Communications, 2024, 卷号: 19, 期号: 6, 页码: 4209-4219
作者:  Zhang,Haijun;  Yang,Ning;  Huangfu,Wei;  Long,Keping;  Leung,VictorCM
Adobe PDF(1925Kb)  |  收藏  |  浏览/下载:38/17  |  提交时间:2024/06/12
Learn to flap: foil non-parametric path planning via deep reinforcement learning 期刊论文
Journal of Fluid Mechanics, 2024, 卷号: 984, 页码: A9
作者:  Wang, Zhipeng;  Lin, Runji;  Zhao, Zhiyu;  Chen, Xu;  Guo, Pengming;  Yang, Ning;  Wang,Zhicheng;  Fan, Dixia
Adobe PDF(1892Kb)  |  收藏  |  浏览/下载:46/11  |  提交时间:2024/06/07
A cooperation and decision-making framework in dynamic confrontation for multi-agent systems 期刊论文
Computers and Electrical Engineering, 2024, 页码: 118
作者:  Lexing Wang;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi
Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:39/12  |  提交时间:2024/06/06
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
作者:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  收藏  |  浏览/下载:63/22  |  提交时间:2024/06/05
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:36/11  |  提交时间:2024/06/05
Enhancing efficiency and propulsion in bio-mimetic robotic fish through end-to-end deep reinforcement learning 期刊论文
Physics of Fluids, 2024, 卷号: 36, 期号: 3, 页码: 031910
作者:  Cui,Xinyu;  Sun,Boai;  Zhu,Yi;  Yang,Ning;  Zhang,Haifeng;  Cui,Weicheng;  Fan,Dixia;  Wang,Jun
Adobe PDF(4056Kb)  |  收藏  |  浏览/下载:60/23  |  提交时间:2024/06/02
bio-mimetic robotic fish  deep reinforcement learning  
Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文
Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8
作者:  He SQ(何少钦);  Gao Y(高阳);  Zhang BF(张保丰);  Chang H(常惠);  Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:53/19  |  提交时间:2024/05/31
Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.