CASIA OpenIR

浏览/检索结果: 共144条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:35/14  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:21/9  |  提交时间:2024/06/25
Hitch-Hiking Motion of Multiple Bionic Robotic Remoras with Enhanced Multimodal Locomotion 期刊论文
IEEE-ASME Transactions on Mechatronics, 2024, 页码: 1-11
作者:  Wu, Zhengxing;  Yu, Lianyi;  Wang, Jian;  Dai, Shijie;  Tan, Min;  Yu, Junzhi
Adobe PDF(4893Kb)  |  收藏  |  浏览/下载:63/32  |  提交时间:2024/06/24
Learn to flap: foil non-parametric path planning via deep reinforcement learning 期刊论文
Journal of Fluid Mechanics, 2024, 卷号: 984, 页码: A9
作者:  Wang, Zhipeng;  Lin, Runji;  Zhao, Zhiyu;  Chen, Xu;  Guo, Pengming;  Yang, Ning;  Wang,Zhicheng;  Fan, Dixia
Adobe PDF(1892Kb)  |  收藏  |  浏览/下载:49/11  |  提交时间:2024/06/07
A Fish-like Binocular Vision System for Underwater Perception of Robotic Fish 期刊论文
Biomimetics, 2024, 页码: 171
作者:  Tong Ru;  Wu Zhengxing;  Wang Jinge;  Huang Yupei;  Chen Di;  Yu Junzhi
Adobe PDF(4134Kb)  |  收藏  |  浏览/下载:40/15  |  提交时间:2024/06/06
Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural language. 会议论文
, 北京华腾美居酒店, 2023-12-9
作者:  Zhourui Guo;  Meng Yao;  Yang Yu;  Qiyue Yin
Adobe PDF(2302Kb)  |  收藏  |  浏览/下载:37/13  |  提交时间:2024/06/03
稀疏奖励环境下基于自博弈框架的智能空战算法研究 学位论文
, 2024
作者:  何少钦
Adobe PDF(4570Kb)  |  收藏  |  浏览/下载:51/1  |  提交时间:2024/05/30
强化学习,离线强化学习,空战,智能决策,好奇心机制  
Representative Demonstration Selection for In-Context Learning with Two-Stage Determinantal Point Process 会议论文
, Singapore, 2023-12
作者:  Zhao Yang;  Yuanzhe Zhang;  Dianbo Sui;  Cao Liu;  Jun Zhao;  Kang Liu
Adobe PDF(592Kb)  |  收藏  |  浏览/下载:57/24  |  提交时间:2024/05/30
Autonomous Recovery Control of Biomimetic Robotic Fish Based on Multi-Sensory System 期刊论文
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 卷号: 9, 期号: 2, 页码: 1420-1427
作者:  Qiu, Changlin;  Wu, Zhengxing;  Wang, Jian;  Tan, Min;  Yu, Junzhi
Adobe PDF(4262Kb)  |  收藏  |  浏览/下载:72/16  |  提交时间:2024/03/26
Robots  Wires  Tail  Robot sensing systems  Task analysis  Springs  Sensors  Autonomous recovery  motion control  robotic fish  underwater perception  
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:152/5  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system