CASIA OpenIR

浏览/检索结果: 共130条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Exploration via Joint Policy Diversity for Sparse-Reward Multi-Agent Tasks 会议论文
, Macao, China, 2023-8
作者:  Pei Xu;  Junge Zhang;  Kaiqi Huang
Adobe PDF(1369Kb)  |  收藏  |  浏览/下载:220/68  |  提交时间:2023/06/19
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:161/37  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition 会议论文
, Washington D.C., USA, 2023-2-9
作者:  Qingyu Wang;  Tielin Zhang;  Minglun Han;  Yi Wang;  Duzhen Zhang;  Bo Xu
Adobe PDF(1714Kb)  |  收藏  |  浏览/下载:138/44  |  提交时间:2023/06/20
基于噪声对比估计的权重自适应对抗生成式模仿学习 期刊论文
模式识别与人工智能, 2023, 卷号: 36, 期号: 4, 页码: 300-312
作者:  关伟凡;  张希
Adobe PDF(1849Kb)  |  收藏  |  浏览/下载:118/39  |  提交时间:2023/06/29
强化学习  模仿学习  噪声对比估计  自适应权重  
Pseudo Value Network Distillation for High-Performance Exploration 会议论文
, 澳大利亚, 2023-06
作者:  Zhao EM(赵恩民);  Xing JL(兴军亮);  Li K(李凯);  Kang YX(康永欣);  Tao P(陶品)
Adobe PDF(5874Kb)  |  收藏  |  浏览/下载:129/38  |  提交时间:2023/06/28
Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks 会议论文
, Washington DC, USA, 2023-2-7
作者:  Pei Xu;  Junge Zhang;  Qiyue Yin;  Chao Yu;  Yaodong Yang;  Kaiqi Huang
Adobe PDF(2037Kb)  |  收藏  |  浏览/下载:193/60  |  提交时间:2023/06/19
deep reinforcement learning  sparse reward  exploration  multi-agent  
MOSO: Decomposing MOtion, Scene and Object for Video Prediction 会议论文
, Vancouver, Canada, 2023-6-18
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(1504Kb)  |  收藏  |  浏览/下载:93/21  |  提交时间:2023/05/04
Learning Cooperative Policies with Graph Networks in Distributed Swarm Systems 会议论文
, Queensland, Australia, June 18-23, 2023
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强);  Ai XL(艾晓琳);  Yuan GM(袁莞迈)
Adobe PDF(612Kb)  |  收藏  |  浏览/下载:136/45  |  提交时间:2023/06/12
Improving the Ability of Robots to Navigate Through Crowded Environments Safely using Deep Reinforcement Learning 会议论文
, 中国桂林, 2022-7-9
作者:  Shan QF(单钦锋);  Wang WJ(王伟杰);  Guo DF(郭丁飞);  Sun XR(孙向荣);  Jia LH(贾立好)
Adobe PDF(494Kb)  |  收藏  |  浏览/下载:100/29  |  提交时间:2023/06/05
Deep learning  Mechatronics  Navigation  Reinforcement learning  Cost function  Real-time systems  Trajectory  
Contact Force Prediction for a Robotic Transesophageal Ultrasound Probe via Operating Torque Sensing 会议论文
, Singapore, 2022-9-18
作者:  Xie Yiping(谢亿平);  Hou Xilong(侯西龙);  Liu Hongbin(刘宏斌);  Housden James;  Rhode Kawal;  Hou Zeng-Guang(侯增广);  Wang Shuangyi(王双翌)
Adobe PDF(1946Kb)  |  收藏  |  浏览/下载:271/149  |  提交时间:2023/05/31
Ultrasound robot  Continuum robot  Contact force estimation