CASIA OpenIR

浏览/检索结果: 共217条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Learning to Play Football from Sports Perspective: A Knowledge-embedded Deep Reinforcement Learning Framework 期刊论文
IEEE Transactions on Games, 2022, 页码: 12
作者:  Liu BY(刘博寅)
Adobe PDF(2957Kb)  |  收藏  |  浏览/下载:24/5  |  提交时间:2024/07/12
QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 12
作者:  Liu BY(刘博寅)
Adobe PDF(6675Kb)  |  收藏  |  浏览/下载:16/2  |  提交时间:2024/07/12
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:33/15  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
APTMRS: Autonomous Prism Target Maintenance Robotic System for FAST 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 页码: 17
作者:  Tao, Rui;  Jing, Fengshui;  Hou, Jun;  Xing, Shiyu;  Fu, Yichen;  Fan, Junfeng;  Tan, Min
收藏  |  浏览/下载:8/0  |  提交时间:2024/07/04
Bolt assembly  end-effector  pose measurement  manipulation policies  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:19/9  |  提交时间:2024/06/25
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:43/16  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning  
Learn to flap: foil non-parametric path planning via deep reinforcement learning 期刊论文
Journal of Fluid Mechanics, 2024, 卷号: 984, 页码: A9
作者:  Wang, Zhipeng;  Lin, Runji;  Zhao, Zhiyu;  Chen, Xu;  Guo, Pengming;  Yang, Ning;  Wang,Zhicheng;  Fan, Dixia
Adobe PDF(1892Kb)  |  收藏  |  浏览/下载:48/11  |  提交时间:2024/06/07
A Fish-like Binocular Vision System for Underwater Perception of Robotic Fish 期刊论文
Biomimetics, 2024, 页码: 171
作者:  Tong Ru;  Wu Zhengxing;  Wang Jinge;  Huang Yupei;  Chen Di;  Yu Junzhi
Adobe PDF(4134Kb)  |  收藏  |  浏览/下载:40/15  |  提交时间:2024/06/06
Explanation Guided Knowledge Distillation for Pre-trained Language Model Compression 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing, 2024, 卷号: 23, 期号: 2, 页码: 1-19
作者:  Zhao Yang;  Yuanzhe Zhang;  Dianbo Sui;  Yiming Ju;  Jun Zhao;  Kang Liu
Adobe PDF(1250Kb)  |  收藏  |  浏览/下载:54/20  |  提交时间:2024/05/30
Explanation  knowledge distillation  model compression  
Information bottleneck based knowledge selection for commonsense reasoning 期刊论文
Information Sciences, 2024, 卷号: 660, 页码: 120134
作者:  Zhao Yang;  Yuanzhe Zhang;  Pengfei Cao;  Cao Liu;  Jiansong Chen;  Jun Zhao;  Kang Liu
Adobe PDF(1069Kb)  |  收藏  |  浏览/下载:50/16  |  提交时间:2024/05/30
Commonsense reasoning  Knowledge selection  Information bottleneck  KG-augmented model