CASIA OpenIR

浏览/检索结果: 共30条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Learn to flap: foil non-parametric path planning via deep reinforcement learning 期刊论文
Journal of Fluid Mechanics, 2024, 卷号: 984, 页码: A9
作者:  Wang, Zhipeng;  Lin, Runji;  Zhao, Zhiyu;  Chen, Xu;  Guo, Pengming;  Yang, Ning;  Wang,Zhicheng;  Fan, Dixia
Adobe PDF(1892Kb)  |  收藏  |  浏览/下载:35/6  |  提交时间:2024/06/07
Fuzzy Feedback Multi-Agent Reinforcement Learning for Adversarial Dynamic Multi-Team Competitions 期刊论文
IEEE Transactions on Fuzzy Systems, 2024, 页码: 1
作者:  Qingxu Fu;  Zhiqiang Pu;  Yi Pan;  Tenghai Qiu;  Jianqiang Yi
Adobe PDF(4975Kb)  |  收藏  |  浏览/下载:28/10  |  提交时间:2024/06/05
Enhancing efficiency and propulsion in bio-mimetic robotic fish through end-to-end deep reinforcement learning 期刊论文
Physics of Fluids, 2024, 卷号: 36, 期号: 3, 页码: 031910
作者:  Cui,Xinyu;  Sun,Boai;  Zhu,Yi;  Yang,Ning;  Zhang,Haifeng;  Cui,Weicheng;  Fan,Dixia;  Wang,Jun
Adobe PDF(4056Kb)  |  收藏  |  浏览/下载:50/18  |  提交时间:2024/06/02
bio-mimetic robotic fish  deep reinforcement learning  
Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文
Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8
作者:  He SQ(何少钦);  Gao Y(高阳);  Zhang BF(张保丰);  Chang H(常惠);  Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:44/13  |  提交时间:2024/05/31
Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.  
稀疏奖励环境下基于自博弈框架的智能空战算法研究 学位论文
, 2024
作者:  何少钦
Adobe PDF(4570Kb)  |  收藏  |  浏览/下载:35/1  |  提交时间:2024/05/30
强化学习,离线强化学习,空战,智能决策,好奇心机制  
T-Agent: A Term-Aware Agent for Medical Dialogue Generation 会议论文
, Yokohama, Japan, 2024-6-30 - 2023-7-5
作者:  Zefa Hu;  Haozhi Zhao;  Yuanyuan Zhao;  Shuang Xu;  Bo Xu
Adobe PDF(483Kb)  |  收藏  |  浏览/下载:37/9  |  提交时间:2024/05/29
Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement Learning 期刊论文
BIOMIMETICS, 2024, 卷号: 9, 期号: 1, 页码: 19
作者:  Wang, Yu;  Wang, Jian;  Kang, Song;  Yu, Junzhi
Adobe PDF(1553Kb)  |  收藏  |  浏览/下载:67/12  |  提交时间:2024/03/26
biomimetic motion  biomimetic autonomous system  target following  deep reinforcement learning  predictive control  
Hedonic Coalition Formation for Distributed Task Allocation in Heterogeneous Multi-agent System 期刊论文
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2024, 页码: 13
作者:  Wang, Lexing;  Qiu, Tenghai;  Pu, Zhiqiang;  Yi, Jianqiang;  Zhu, Jinying;  Yuan, Wanmai
Adobe PDF(2578Kb)  |  收藏  |  浏览/下载:110/14  |  提交时间:2024/03/13
Coalition formation  hedonic games  heterogeneous agents  Nash stable  task allocation  
A Multi-modal Global Instance Tracking Benchmark (MGIT): Better Locating Target in Complex Spatio-temporal and causal Relationship 会议论文
, New Orleans, 2023-12
作者:  Shiyu, Hu;  Dailing, Zhang;  Meiqi, Wu;  Xiaokun, Feng;  Xuchen, Li;  Xin, Zhao;  Kaiqi, Huang
Adobe PDF(6215Kb)  |  收藏  |  浏览/下载:113/25  |  提交时间:2024/01/22
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:142/3  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system