CASIA OpenIR

浏览/检索结果: 共12条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:31/13  |  提交时间:2024/06/25
强化学习,分层强化学习  
Interpreting Sentiment Composition with Latent Semantic Tree 会议论文
, Toronto, Canada, 2023-7-9
作者:  Zhongtao Jiang;  Yuanzhe Zhang;  Cao Liu;  Jiansong Chen;  Jun Zhao;  Kang Liu
Adobe PDF(509Kb)  |  收藏  |  浏览/下载:40/18  |  提交时间:2024/06/06
A Wire-driven Elastic Robotic Fish and its Design and CPG-Based Control 期刊论文
Journal of Intelligent & Robotic Systems, 2023, 卷号: 107, 期号: 1, 页码: 4
作者:  Xiaocun Liao;  Chao Zhou;  Jian Wang;  Junfeng Fan;  Zhuoliang Zhang
Adobe PDF(1749Kb)  |  收藏  |  浏览/下载:51/18  |  提交时间:2024/05/28
Robotic fish  Wire-driven mode  Elastic component  Kinematics model  Body wave  
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:149/4  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system  
Locating Dipole Source Using Self-Propelled Robotic Fish With Artificial Lateral Line System 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 页码: 11
作者:  Qiu, Changlin;  Wu, Zhengxing;  Wang, Jian;  Tan, Min;  Yu, Junzhi
Adobe PDF(1507Kb)  |  收藏  |  浏览/下载:117/13  |  提交时间:2023/11/16
Underwater robot  robotic fish  artificial lateral line  noise estimation  dipole source localization  
A Torque Control Strategy for a Robotic Dolphin Platform Based on Angle of Attack Feedback 期刊论文
Biomimetics, 2023, 卷号: 8, 页码: 291
作者:  Tianzhu Wang;  Junzhi Yu;  Di Chen;  Yan Meng
Adobe PDF(1925Kb)  |  收藏  |  浏览/下载:174/64  |  提交时间:2023/09/21
robotic dolphin  torque control  angle of attack  motion improvement  
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:203/47  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Pseudo Value Network Distillation for High-Performance Exploration 会议论文
, 澳大利亚, 2023-06
作者:  Zhao EM(赵恩民);  Xing JL(兴军亮);  Li K(李凯);  Kang YX(康永欣);  Tao P(陶品)
Adobe PDF(5874Kb)  |  收藏  |  浏览/下载:158/45  |  提交时间:2023/06/28
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:174/34  |  提交时间:2023/06/21
Efficient Accelerator/Network Co-Search with Circular Greedy Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, 2023, 页码: 1-5
作者:  Liu, Zejian;  Li, Gang;  Cheng, Jian
Adobe PDF(1982Kb)  |  收藏  |  浏览/下载:135/43  |  提交时间:2023/06/19
Accelerator/Network Co-Search  Reinforcement Learning  Performance Estimation  Multi-objective Optimization