CASIA OpenIR

浏览/检索结果: 共87条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
AdaNSP: Uncertainty-driven Adaptive Decoding in Neural Semantic Parsing 会议论文
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, 2019-07
作者:  Zhang X(张翔);  He SZ(何世柱);  Liu K(刘康);  Zhao J(赵军)
Adobe PDF(400Kb)  |  收藏  |  浏览/下载:28/9  |  提交时间:2024/06/26
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:19/9  |  提交时间:2024/06/25
Power Control Based on Deep Reinforcement Learning for Spectrum Sharing 期刊论文
IEEE Transactions on Wireless Communications, 2024, 卷号: 19, 期号: 6, 页码: 4209-4219
作者:  Zhang,Haijun;  Yang,Ning;  Huangfu,Wei;  Long,Keping;  Leung,VictorCM
Adobe PDF(1925Kb)  |  收藏  |  浏览/下载:40/17  |  提交时间:2024/06/12
Multi-Scale Dynamic Coding Improved Spiking Actor Network for Reinforcement Learning 会议论文
, Online, February 22–March 1, 2022
作者:  Zhang, Duzhen;  Zhang, Tielin;  Jia, Shuncheng;  Xu, Bo
Adobe PDF(2249Kb)  |  收藏  |  浏览/下载:36/14  |  提交时间:2024/06/11
Learn to flap: foil non-parametric path planning via deep reinforcement learning 期刊论文
Journal of Fluid Mechanics, 2024, 卷号: 984, 页码: A9
作者:  Wang, Zhipeng;  Lin, Runji;  Zhao, Zhiyu;  Chen, Xu;  Guo, Pengming;  Yang, Ning;  Wang,Zhicheng;  Fan, Dixia
Adobe PDF(1892Kb)  |  收藏  |  浏览/下载:47/11  |  提交时间:2024/06/07
Generative Calibration for In-context Learning 会议论文
, Singapore, 2023-10-6
作者:  Zhongtao Jiang;  Yuanzhe Zhang;  Cao Liu;  Jun Zhao;  Kang Liu
Adobe PDF(763Kb)  |  收藏  |  浏览/下载:37/16  |  提交时间:2024/06/06
Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文
Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8
作者:  He SQ(何少钦);  Gao Y(高阳);  Zhang BF(张保丰);  Chang H(常惠);  Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:57/21  |  提交时间:2024/05/31
Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.  
稀疏奖励环境下基于自博弈框架的智能空战算法研究 学位论文
, 2024
作者:  何少钦
Adobe PDF(4570Kb)  |  收藏  |  浏览/下载:46/1  |  提交时间:2024/05/30
强化学习,离线强化学习,空战,智能决策,好奇心机制  
Toward the Intelligent, Safe Exploration of a Biomimetic Underwater Robot: Modeling, Planning, and Control 期刊论文
Biomimetics, 2024, 期号: 9, 页码: 126
作者:  Wang, Yu;  Wang, Jian;  Yu Lianyi;  Kong Shihan;  Yu Junzhi
Adobe PDF(1171Kb)  |  收藏  |  浏览/下载:53/17  |  提交时间:2024/05/30
Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文
, Queensland, Australia, 2023-6
作者:  Yang, Chen;  Yang, Guangkai;  Chen, Hao;  Zhang, Junge
Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:58/22  |  提交时间:2024/05/29