CASIA OpenIR

浏览/检索结果: 共218条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement Learning 期刊论文
BIOMIMETICS, 2024, 卷号: 9, 期号: 1, 页码: 19
作者:  Wang, Yu;  Wang, Jian;  Kang, Song;  Yu, Junzhi
收藏  |  浏览/下载:8/0  |  提交时间:2024/03/26
biomimetic motion  biomimetic autonomous system  target following  deep reinforcement learning  predictive control  
Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms 期刊论文
NEURAL NETWORKS, 2024, 卷号: 169, 页码: 764-777
作者:  Chen, Yurou;  Zhang, Fengyi;  Liu, Zhiyong
收藏  |  浏览/下载:16/0  |  提交时间:2024/02/22
Reinforcement Learning  Policy gradient  Actor-critic  Value function  Bias-variance trade-off  
Peer Incentive Reinforcement Learning for Cooperative Multiagent Games 期刊论文
IEEE TRANSACTIONS ON GAMES, 2023, 卷号: 15, 期号: 4, 页码: 623-636
作者:  Zhang, Tianle;  Liu, Zhen;  Pu, Zhiqiang;  Yi, Jianqiang
收藏  |  浏览/下载:18/0  |  提交时间:2024/02/22
Cooperative multiagent games  intrinsic reward  multiagent reinforcement learning (MARL)  Starcraft II Micromanagement  
Sample-Observed Soft Actor-Critic Learning for Path Following of a Biomimetic Underwater Vehicle 期刊论文
IEEE Transactions on Automation Science and Engineering, 2023, 页码: 1-10
作者:  Ma, Ruichen;  Wang, Yu;  Wang, Shuo;  Cheng, Long;  Wang, Rui;  Tan, Ming
Adobe PDF(2902Kb)  |  收藏  |  浏览/下载:163/52  |  提交时间:2023/08/03
Efficient Accelerator/Network Co-Search with Circular Greedy Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, 2023, 页码: 1-5
作者:  Liu, Zejian;  Li, Gang;  Cheng, Jian
Adobe PDF(1982Kb)  |  收藏  |  浏览/下载:104/32  |  提交时间:2023/06/19
Accelerator/Network Co-Search  Reinforcement Learning  Performance Estimation  Multi-objective Optimization  
A neural network based framework for variable impedance skills learning from demonstrations 期刊论文
ROBOTICS AND AUTONOMOUS SYSTEMS, 2023, 卷号: 160, 页码: 10
作者:  Zhang, Yu;  Cheng, Long;  Cao, Ran;  Li, Houcheng;  Yang, Chenguang
Adobe PDF(3824Kb)  |  收藏  |  浏览/下载:279/55  |  提交时间:2023/02/22
Variable impedance skill  Learning from demonstrations  Skills learning  Human-robot interaction  
Hierarchical graph attention network for temporal knowledge graph reasoning 期刊论文
Neurocomputing, 2023, 页码: 126390
作者:  Shao PP(邵朋朋)
Adobe PDF(1512Kb)  |  收藏  |  浏览/下载:85/25  |  提交时间:2023/07/03
Adaptive pseudo-Siamese policy network for temporal knowledge prediction 期刊论文
Neural Networks, 2023, 卷号: 160, 页码: 192-201
作者:  Shao PP(邵朋朋)
Adobe PDF(1256Kb)  |  收藏  |  浏览/下载:81/29  |  提交时间:2023/07/03
Enhanced Rolling Horizon Evolution Algorithm With Opponent Model Learning: Results for the Fighting Game AI Competition 期刊论文
IEEE TRANSACTIONS ON GAMES, 2023, 卷号: 5, 期号: 1, 页码: 5 - 15
作者:  Zhentao Tang;  Yuanheng Zhu;  Dongbin Zhao;  Simon M. Lucas
Adobe PDF(7686Kb)  |  收藏  |  浏览/下载:219/60  |  提交时间:2021/07/05
Rolling horizon evolution  opponent model  reinforcement learning  supervised learning  fighting game  
Residual Reinforcement Learning for Motion Control of a Bionic Exploration Robot - RoboDact 期刊论文
IEEE Transactions on Instrumentation and Measurement, 2023, 页码: 1-13
作者:  Zhang Tiandong;  Wang Rui;  Wang Shuo;  Wang Yu;  Zheng Gang;  Tan Min
Adobe PDF(3127Kb)  |  收藏  |  浏览/下载:111/43  |  提交时间:2023/06/14