CASIA OpenIR

浏览/检索结果: 共6条,第1-6条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Recent Progress in Reinforcement Learning and Adaptive Dynamic Programming for Advanced Control Applications 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 1, 页码: 18-36
作者:  Ding Wang;  Ning Gao;  Derong Liu;  Jinna Li;  Frank L. Lewis
Adobe PDF(1945Kb)  |  收藏  |  浏览/下载:317/203  |  提交时间:2024/01/02
Adaptive dynamic programming (ADP)  advanced control  complex environment  data-driven control  event-triggered design  intelligent control  neural networks  nonlinear systems  optimal control  reinforcement learning (RL)  
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 12, 页码: 2233-2247
作者:  Hongyu Ding;  Yuanze Tang;  Qing Wu;  Bo Wang;  Chunlin Chen;  Zhi Wang
Adobe PDF(5205Kb)  |  收藏  |  浏览/下载:131/42  |  提交时间:2023/10/31
Dynamic environments  goal-conditioned reinforcement learning  magnetic field  reward shaping  
Cooperative and Competitive Multi-Agent Systems: From Optimization to Games 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 5, 页码: 763-783
作者:  Jianrui Wang;  Yitian Hong;  Jiali Wang;  Jiapeng Xu;  Yang Tang;  Qing-Long Han;  Jürgen Kurths
Adobe PDF(8407Kb)  |  收藏  |  浏览/下载:246/68  |  提交时间:2022/04/24
Cooperative games  counterfactual regret min- imization  distributed optimization  federated optimization  fictitious self-play  mean field games  multi-agent reinforcement learning  non-cooperative games  
Hierarchical Reinforcement Learning With Automatic Sub-Goal Identification 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2021, 卷号: 8, 期号: 10, 页码: 1686-1696
作者:  Chenghao Liu;  Fei Zhu;  Quan Liu;  Yuchen Fu
Adobe PDF(5095Kb)  |  收藏  |  浏览/下载:142/51  |  提交时间:2021/09/03
Hierarchical control  hierarchical reinforcement learning  option  sparse reward  sub-goal  
Path Planning for Intelligent Robots Based on Deep Q-learning With Experience Replay and Heuristic Knowledge 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2020, 卷号: 7, 期号: 4, 页码: 1179-1189
作者:  Lan Jiang;  Hongyun Huang;  Zuohua Ding
浏览  |  Adobe PDF(1955Kb)  |  收藏  |  浏览/下载:134/52  |  提交时间:2021/03/11
Deep Q-learning (DQL)  experience replay (ER)  heuristic knowledge (HK)  path planning  
Approximate Dynamic Programming for Stochastic Resource Allocation Problems 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2020, 卷号: 7, 期号: 4, 页码: 975-990
作者:  Ali Forootani;  Raffaele Iervolino;  Massimo Tipaldi;  Joshua Neilson
浏览  |  Adobe PDF(3558Kb)  |  收藏  |  浏览/下载:151/47  |  提交时间:2021/03/11
Approximate dynamic programming (ADP)  dynamic programming (DP)  Markov decision processes (MDPs)  resource allocation problem