CASIA OpenIR

浏览/检索结果: 共23条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
A memory and attention-based reinforcement learning for musculoskeletal robots with prior knowledge of muscle synergies 期刊论文
Robotic Intelligence and Automation, 2024, 卷号: 44, 期号: 2, 页码: 316-333
作者:  Xiaona Wang;  Jiahao Chen;  Hong Qiao
Adobe PDF(2591Kb)  |  收藏  |  浏览/下载:60/18  |  提交时间:2024/06/04
Musculoskeletal robot  Partial observable  Reinforcement learning  LSTM  Attention  Muscle synergy  
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 卷号: 54, 期号: 3, 页码: 1414-1426
作者:  Li, Tao;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(784Kb)  |  收藏  |  浏览/下载:121/22  |  提交时间:2024/02/22
Performance analysis  Optimal control  Dynamic programming  Iterative algorithms  Upper bound  Measurement  Convergence  Adaptive dynamic programming (ADP)  isoperimetric constraints  nonlinear systems  optimal control  policy iteration  
Equilibrium Strategy of the Pursuit-Evasion Game in Three-Dimensional Space 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 2, 页码: 446-458
作者:  Nuo Chen;  Linjing Li;  Wenji Mao
Adobe PDF(3567Kb)  |  收藏  |  浏览/下载:154/38  |  提交时间:2024/01/23
Differential game  equilibrium strategy  pursuit-evasion game  threedegree-of-freedom control  
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:154/5  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system  
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:63/3  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow  
Data Generation Feedback Relearning Control for Unmodeled Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 页码: 12
作者:  Zhang, Yong;  Mu, Chaoxu;  Zhao, Dongbin
收藏  |  浏览/下载:112/0  |  提交时间:2023/11/16
Data models  Real-time systems  Heuristic algorithms  Mathematical models  Adaptation models  Approximation algorithms  Cost function  Data generation model  feedback relearning control  delayed neural network  reinforcement learning  unmodeled nonlinear system  
Deep Reinforcement Learning With Part-Aware Exploration Bonus in Video Games 期刊论文
IEEE TRANSACTIONS ON GAMES, 2022, 卷号: 14, 期号: 4, 页码: 644-653
作者:  Xu, Pei;  Yin, Qiyue;  Zhang, Junge;  Huang, Kaiqi
Adobe PDF(1480Kb)  |  收藏  |  浏览/下载:349/90  |  提交时间:2023/02/22
Deep learning  exploration  reinforcement learning  video game  
Solving the spike feature information vanishing problem in spiking deep Q network with potential based normalization 期刊论文
FRONTIERS IN NEUROSCIENCE, 2022, 卷号: 16, 页码: 11
作者:  Sun, Yinqian;  Zeng, Yi;  Li, Yang
Adobe PDF(1561Kb)  |  收藏  |  浏览/下载:273/41  |  提交时间:2022/11/14
brain-inspired decision model  SDQN  reinforcement learning  potential normalization  spiking activity  
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Adobe PDF(4187Kb)  |  收藏  |  浏览/下载:266/12  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
Decentralized Event-Driven Constrained Control Using Adaptive Critic Designs 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 15
作者:  Yang, Xiong;  Zhu, Yuanheng;  Dong, Na;  Wei, Qinglai
Adobe PDF(1578Kb)  |  收藏  |  浏览/下载:240/16  |  提交时间:2022/01/27
Adaptive critic designs (ACDs)  adaptive dynamic programming (ADP)  decentralized event-driven control  input constraint  reinforcement learning (RL)