CASIA OpenIR

浏览/检索结果: 共5条,第1-5条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Learn to flap: foil non-parametric path planning via deep reinforcement learning 期刊论文
Journal of Fluid Mechanics, 2024, 卷号: 984, 页码: A9
作者:  Wang, Zhipeng;  Lin, Runji;  Zhao, Zhiyu;  Chen, Xu;  Guo, Pengming;  Yang, Ning;  Wang,Zhicheng;  Fan, Dixia
Adobe PDF(1892Kb)  |  收藏  |  浏览/下载:28/4  |  提交时间:2024/06/07
A cooperation and decision-making framework in dynamic confrontation for multi-agent systems 期刊论文
Computers and Electrical Engineering, 2024, 页码: 118
作者:  Lexing Wang;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi
Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:22/5  |  提交时间:2024/06/06
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
作者:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  收藏  |  浏览/下载:42/13  |  提交时间:2024/06/05
Enhancing efficiency and propulsion in bio-mimetic robotic fish through end-to-end deep reinforcement learning 期刊论文
Physics of Fluids, 2024, 卷号: 36, 期号: 3, 页码: 031910
作者:  Cui,Xinyu;  Sun,Boai;  Zhu,Yi;  Yang,Ning;  Zhang,Haifeng;  Cui,Weicheng;  Fan,Dixia;  Wang,Jun
Adobe PDF(4056Kb)  |  收藏  |  浏览/下载:44/15  |  提交时间:2024/06/02
bio-mimetic robotic fish  deep reinforcement learning  
Contrastive Correlation Preserving Replay for Online Continual Learning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 124-139
作者:  Yu, Da;  Zhang, Mingyi;  Li, Mantian;  Zha, Fusheng;  Zhang, Junge;  Sun, Lining;  Huang, Kaiqi
收藏  |  浏览/下载:52/0  |  提交时间:2024/03/26
Task analysis  Correlation  Knowledge transfer  Training  Memory management  Data models  Mutual information  Continual learning  catastrophic forgetting  class-incremental learning  experience replay