CASIA OpenIR

浏览/检索结果: 共14条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Hierarchical Cooperative Swarm Policy Learning with Role Emergence 会议论文
, Online, 05-07 December 2021
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Qiu TH(丘腾海);  Yi JQ(易建强)
Adobe PDF(327Kb)  |  收藏  |  浏览/下载:127/57  |  提交时间:2023/06/12
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Chai, Jiajun;  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin;  Ma, Zhe;  Sun, Kewu;  Ding, Jishiyu
Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:247/26  |  提交时间:2022/01/27
Multi-agent systems  Training  Task analysis  Reinforcement learning  Sun  Learning systems  Semantics  Centralized training with decentralized execution (CTDE)  multiagent  reinforcement learning  StarCraft II  
Spiking Adaptive Dynamic Programming Based on Poisson Process for Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 11
作者:  Wei, Qinglai;  Han, Liyuan;  Zhang, Tielin
Adobe PDF(2904Kb)  |  收藏  |  浏览/下载:192/1  |  提交时间:2022/01/27
Maximum likelihood estimation (MLE)  Nonlinear systems  Optimal control  Poisson process  Spike train  Spiking Adaptive dynamic programming(SADP)  
Hierarchical Motion Learning for Goal-Oriented Movements With Speed-Accuracy Tradeoff of a Musculoskeletal System 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 页码: 14
作者:  Zhou, Junjie;  Zhong, Shanlin;  Wu, Wei
Adobe PDF(5440Kb)  |  收藏  |  浏览/下载:264/46  |  提交时间:2022/01/27
Brain-inspired decision making  Fitts' law  Motion generation  Musculoskeletal system  Speed-accuracy tradeoff (SAT)  
Learning to Assemble Noncylindrical Parts Using Trajectory Learning and Force Tracking 期刊论文
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2021, 页码: 12
作者:  Su, Jianhua;  Meng, Yan;  Wang, Lili;  Yang, Xu
Adobe PDF(4865Kb)  |  收藏  |  浏览/下载:327/54  |  提交时间:2022/01/27
Force  Trajectory  Robots  Task analysis  Hidden Markov models  Impedance  Training  Assembly skill  impedance control  learning from demonstration  movement primitives (MPs)  
Generalized Actor-Critic Learning Optimal Control in Smart Home Energy Management 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 卷号: 17, 期号: 10, 页码: 6614-6623
作者:  Wei, Qinglai;  Liao, Zehua;  Shi, Guang
Adobe PDF(1229Kb)  |  收藏  |  浏览/下载:261/33  |  提交时间:2021/11/02
Optimal control  Process control  Smart homes  Dynamic programming  Numerical models  Iterative methods  Informatics  Actor-critic learning  adaptive critic designs  adaptive dynamic programming (ADP)  approximate dynamic programming  energy management  optimal control  smart grid  
Learning Control for Air Conditioning Systems via Human Expressions 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2021, 卷号: 68, 期号: 8, 页码: 7662-7671
作者:  Wei, Qinglai;  Li, Tao;  Liu, Derong
Adobe PDF(1354Kb)  |  收藏  |  浏览/下载:221/1  |  提交时间:2021/06/15
Adaptive dynamic programming  air conditioning control  deep learning (DL)  deep Q-network (DQN)  human expressions  optimal control  reinforcement learning (RL)  Q-learning  
Output-Feedback Based Simplified Optimized Backstepping Control for Strict-Feedback Systems with Input and State Constraints 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2021, 卷号: 8, 期号: 6, 页码: 1119-1132
作者:  Jiaxin Zhang;  Kewen Li;  Yongming Li
Adobe PDF(1727Kb)  |  收藏  |  浏览/下载:121/46  |  提交时间:2021/06/11
Backstepping design  immeasurable states  neural-networks (NNs)  optimal control  state constraints  
Continuous-Time Distributed Policy Iteration for Multicontroller Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 卷号: 51, 期号: 5, 页码: 2372-2383
作者:  Wei, Qinglai;  Li, Hongyang;  Yang, Xiong;  He, Haibo
Adobe PDF(1246Kb)  |  收藏  |  浏览/下载:232/40  |  提交时间:2021/06/07
Optimal control  Nonlinear systems  Decentralized control  Mathematical model  Convergence  Multi-agent systems  Adaptive dynamic programming (ADP)  approximate dynamic programming  distributed policy iteration  nonlinear systems  optimal control  
Neural-Network-Based Control for Discrete-Time Nonlinear Systems with Input Saturation Under Stochastic Communication Protocol 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2021, 卷号: 8, 期号: 4, 页码: 766-778
作者:  Xueli Wang;  Derui Ding;  Hongli Dong;  Xian-Ming Zhang
Adobe PDF(1995Kb)  |  收藏  |  浏览/下载:203/43  |  提交时间:2021/04/09
Adaptive dynamic programming (ADP)  constrained inputs  neural network (NN)  stochastic communication protocols (SCPs)  suboptimal control