CASIA OpenIR

浏览/检索结果: 共401条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:10/4  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Discrete Vortex Method-Based Fish-Like Locomotion Modeling 期刊论文
IEEE JOURNAL OF OCEANIC ENGINEERING, 2024, 页码: 13
作者:  Zou, Qianqian;  Zhou, Chao;  Zhu, Chunhui;  Zhang, Zhuoliang;  Fan, Junfeng
收藏  |  浏览/下载:0/0  |  提交时间:2024/07/03
Mathematical models  Hydrodynamics  Robot kinematics  Tail  Analytical models  Shape  Kinematics  Discrete vortex method (DVM)  dynamic modeling  robotic fish  
AdaNSP: Uncertainty-driven Adaptive Decoding in Neural Semantic Parsing 会议论文
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, 2019-07
作者:  Zhang X(张翔);  He SZ(何世柱);  Liu K(刘康);  Zhao J(赵军)
Adobe PDF(400Kb)  |  收藏  |  浏览/下载:20/6  |  提交时间:2024/06/26
Online Optimization of Normalized CPGs for a Multi-Joint Robotic Fish 会议论文
, 中国,上海, 2021年7月
作者:  Tong R(仝茹);  Wu ZX(吴正兴);  Wang J(王健);  Tan M(谭民);  Yu JZ(喻俊志)
Adobe PDF(456Kb)  |  收藏  |  浏览/下载:12/6  |  提交时间:2024/06/26
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:10/5  |  提交时间:2024/06/25
Hitch-Hiking Motion of Multiple Bionic Robotic Remoras with Enhanced Multimodal Locomotion 期刊论文
IEEE-ASME Transactions on Mechatronics, 2024, 页码: 1-11
作者:  Wu, Zhengxing;  Yu, Lianyi;  Wang, Jian;  Dai, Shijie;  Tan, Min;  Yu, Junzhi
Adobe PDF(4893Kb)  |  收藏  |  浏览/下载:25/9  |  提交时间:2024/06/24
Improving Generalization of Multi-agent Reinforcement Learning through Domain-Invariant Feature Extraction 会议论文
, Greece, 2023-5
作者:  Xu YF(徐一凡);  Pu ZQ(蒲志强);  Cai QA(蔡奇昂);  Li FM(李非墨);  Chai XH(柴兴华)
Adobe PDF(7610Kb)  |  收藏  |  浏览/下载:8/4  |  提交时间:2024/06/21
A Double-Observation Policy Learning Framework for Multi-target Coverage with Connectivity Maintenance 会议论文
, online, 2022-2
作者:  Xu YF(徐一凡);  Pu ZQ(蒲志强);  Wu SG(吴士广);  Liu BY(刘博寅);  Yi JQ(易建强);  Geng HJ(耿虎军);  Chai XH(柴兴华)
Adobe PDF(9582Kb)  |  收藏  |  浏览/下载:8/2  |  提交时间:2024/06/21
MoDE-CoTD: Chain-of-Thought Distillation for Complex Reasoning Tasks with Mixture of Decoupled LoRA-Experts 会议论文
, Torino (Italia), 2024.5.20 - 2024.5.25
作者:  Xiang Li;  Shizhu He;  Jiayu Wu;  Zhao Yang;  Yao Xu;  Yang Jun;  Haifeng Liu;  Kang Liu;  Jun Zhao
Adobe PDF(1062Kb)  |  收藏  |  浏览/下载:18/4  |  提交时间:2024/06/20
Power Control Based on Deep Reinforcement Learning for Spectrum Sharing 期刊论文
IEEE Transactions on Wireless Communications, 2024, 卷号: 19, 期号: 6, 页码: 4209-4219
作者:  Zhang,Haijun;  Yang,Ning;  Huangfu,Wei;  Long,Keping;  Leung,VictorCM
Adobe PDF(1925Kb)  |  收藏  |  浏览/下载:25/11  |  提交时间:2024/06/12