CASIA OpenIR

浏览/检索结果: 共153条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:24/12  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:18/9  |  提交时间:2024/06/25
Enhancing Reinforcement Learning via Transformer-based State Predictive Representations 期刊论文
IEEE Transactions on Artificial Intelligence, 2024, 页码: 1 - 12
作者:  Liu MS(刘民颂);  Zhu YH(朱圆恒);  Chen YR(陈亚冉);  Zhao DB(赵冬斌)
Adobe PDF(1162Kb)  |  收藏  |  浏览/下载:28/8  |  提交时间:2024/06/24
Power Control Based on Deep Reinforcement Learning for Spectrum Sharing 期刊论文
IEEE Transactions on Wireless Communications, 2024, 卷号: 19, 期号: 6, 页码: 4209-4219
作者:  Zhang,Haijun;  Yang,Ning;  Huangfu,Wei;  Long,Keping;  Leung,VictorCM
Adobe PDF(1925Kb)  |  收藏  |  浏览/下载:38/17  |  提交时间:2024/06/12
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:37/14  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning  
Learn to flap: foil non-parametric path planning via deep reinforcement learning 期刊论文
Journal of Fluid Mechanics, 2024, 卷号: 984, 页码: A9
作者:  Wang, Zhipeng;  Lin, Runji;  Zhao, Zhiyu;  Chen, Xu;  Guo, Pengming;  Yang, Ning;  Wang,Zhicheng;  Fan, Dixia
Adobe PDF(1892Kb)  |  收藏  |  浏览/下载:46/11  |  提交时间:2024/06/07
Secure Tracking Control via Fixed-Time Convergent Reinforcement Learning for a UAV CPS 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1699-1701
作者:  Zhenyu Gong;  Feisheng Yang
Adobe PDF(604Kb)  |  收藏  |  浏览/下载:43/18  |  提交时间:2024/06/07
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 页码: 1-13
作者:  Guangzheng Hu;  Yuanheng Zhu;  Haoran Li;  Dongbin Zhao
Adobe PDF(2144Kb)  |  收藏  |  浏览/下载:37/7  |  提交时间:2024/06/05
Games  Q-learning  Task analysis  Optimization  Convergence  Training  Nash equilibrium  Multi-agent reinforcement learning  minimax-Q learning  two-team zero-sum Markov games  
A memory and attention-based reinforcement learning for musculoskeletal robots with prior knowledge of muscle synergies 期刊论文
Robotic Intelligence and Automation, 2024, 卷号: 44, 期号: 2, 页码: 316-333
作者:  Xiaona Wang;  Jiahao Chen;  Hong Qiao
Adobe PDF(2591Kb)  |  收藏  |  浏览/下载:52/15  |  提交时间:2024/06/04
Musculoskeletal robot  Partial observable  Reinforcement learning  LSTM  Attention  Muscle synergy  
Motion Learning for Musculoskeletal Robots Based on Cortex-Inspired Motor Primitives and Modulation 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 卷号: 16, 期号: 2, 页码: 744-756
作者:  Xiaona Wang;  Jiahao Chen;  Wei Wu
Adobe PDF(3444Kb)  |  收藏  |  浏览/下载:32/8  |  提交时间:2024/06/04
Biologically inspired control  motor preparation  motor primitive  musculoskeletal robot  recurrent neural network (RNN)