CASIA OpenIR

浏览/检索结果: 共278条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
CASOG: Conservative Actor–Critic With SmOoth Gradient for Skill Learning in Robot-Assisted Intervention 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 页码: 10
作者:  Li, Hao;  Zhou, Xiao-Hu;  Xie, Xiao-Liang;  Liu, Shi-Qi;  Feng, Zhen-Qiu;  Hou, Zeng-Guang
收藏  |  浏览/下载:40/0  |  提交时间:2024/02/22
Deep neural network  offline reinforcement learning  robot-assisted intervention  vascular robotic system  
Peer Incentive Reinforcement Learning for Cooperative Multiagent Games 期刊论文
IEEE TRANSACTIONS ON GAMES, 2023, 卷号: 15, 期号: 4, 页码: 623-636
作者:  Zhang, Tianle;  Liu, Zhen;  Pu, Zhiqiang;  Yi, Jianqiang
收藏  |  浏览/下载:19/0  |  提交时间:2024/02/22
Cooperative multiagent games  intrinsic reward  multiagent reinforcement learning (MARL)  Starcraft II Micromanagement  
Approximate Dynamic Programming for Event-Driven H-8 Constrained Control 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 11
作者:  Yang, Xiong;  Xu, Mengmeng;  Wei, Qinglai
收藏  |  浏览/下载:52/0  |  提交时间:2023/11/17
Approximate dynamic programming (ADP)  event-driven control  H(8)control  input constraint  optimal control  
Sample-Observed Soft Actor-Critic Learning for Path Following of a Biomimetic Underwater Vehicle 期刊论文
IEEE Transactions on Automation Science and Engineering, 2023, 页码: 1-10
作者:  Ma, Ruichen;  Wang, Yu;  Wang, Shuo;  Cheng, Long;  Wang, Rui;  Tan, Ming
Adobe PDF(2902Kb)  |  收藏  |  浏览/下载:165/53  |  提交时间:2023/08/03
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:161/37  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Deep Contrastive Multiview Network Embedding 会议论文
Proceedings of the 31st ACM International Conference on Information and Knowledge Management, New York, NY, USA, 2022-10-17
作者:  Mengqi Zhang;  Yanqiao Zhu;  Qiang Liu;  Shu Wu;  Liang Wang
Adobe PDF(1307Kb)  |  收藏  |  浏览/下载:109/33  |  提交时间:2023/07/03
Potential Driven Reinforcement Learning for Hard Exploration Tasks 会议论文
, 线上, 2020-4
作者:  Zhao EM(赵恩民);  Deng SH(邓诗弘);  Zang YF(臧一凡);  Kang YX(康永欣);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(1999Kb)  |  收藏  |  浏览/下载:69/24  |  提交时间:2023/06/29
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文
, 线上, 2022-02-22
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li JQ(李金秋);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:104/41  |  提交时间:2023/06/29
Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文
, 线上, 2021-02
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li K(李凯);  Li LJ(李丽娟);  Xing JL(兴军亮)
Adobe PDF(413Kb)  |  收藏  |  浏览/下载:128/50  |  提交时间:2023/06/28
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:91/28  |  提交时间:2023/06/27