CASIA OpenIR

浏览/检索结果: 共14条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 12, 页码: 2233-2247
作者:  Hongyu Ding;  Yuanze Tang;  Qing Wu;  Bo Wang;  Chunlin Chen;  Zhi Wang
Adobe PDF(5205Kb)  |  收藏  |  浏览/下载:98/32  |  提交时间:2023/10/31
Dynamic environments  goal-conditioned reinforcement learning  magnetic field  reward shaping  
Privacy Preserving Demand Side Management Method via Multi-Agent Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 10, 页码: 1984-1999
作者:  Feiye Zhang;  Qingyu Yang;  Dou An
Adobe PDF(3841Kb)  |  收藏  |  浏览/下载:80/41  |  提交时间:2023/09/07
Centralized training and decentralized execution  demand side management  multi-agent reinforcement learning  privacy preserving  
Adaptive Multi-Step Evaluation Design With Stability Guarantee for Discrete-Time Optimal Learning Control 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 9, 页码: 1797-1809
作者:  Ding Wang;  Jiangyu Wang;  Mingming Zhao;  Peng Xin;  Junfei Qiao
Adobe PDF(5140Kb)  |  收藏  |  浏览/下载:141/59  |  提交时间:2023/08/10
Adaptive critic  artificial neural networks  Hamilton-Jacobi-Bellman (HJB) equation  multi-step heuristic dynamic programming  multi-step reinforcement learning  optimal control  
Survey on AI and Machine Learning Techniques for Microgrid Energy Management Systems 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 7, 页码: 1513-1529
作者:  Aditya Joshi;  Skieler Capezza;  Ahmad Alhaji;  Mo-Yuen Chow
Adobe PDF(4919Kb)  |  收藏  |  浏览/下载:167/77  |  提交时间:2023/06/14
Consensus  energy management system (EMS)  reinforcement learning  supervised learning  
A Data-Based Feedback Relearning Algorithm for Uncertain Nonlinear Systems 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 5, 页码: 1288-1303
作者:  Chaoxu Mu;  Yong Zhang;  Guangbin Cai;  Ruijun Liu;  Changyin Sun
Adobe PDF(4205Kb)  |  收藏  |  浏览/下载:218/108  |  提交时间:2023/04/26
Data episodes  experience replay  neural networks  reinforcement learning (RL)  uncertain systems  
A Brief Overview of ChatGPT: The History, Status Quo and Potential Future Development 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 5, 页码: 1122-1136
作者:  Tianyu Wu;  Shizhu He;  Jingping Liu;  Siqi Sun;  Kang Liu;  Qing-Long Han;  Yang Tang
Adobe PDF(4650Kb)  |  收藏  |  浏览/下载:792/655  |  提交时间:2023/04/26
AIGC  ChatGPT  GPT-3  GPT-4  human feedback  large language models  
Machine Learning Accelerated Real-Time Model Predictive Control for Power Systems 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 4, 页码: 916-930
作者:  Ramij Raja Hossain;  Ratnesh Kumar
Adobe PDF(5320Kb)  |  收藏  |  浏览/下载:211/56  |  提交时间:2023/03/22
Machine learning  model predictive control (MPC)  neural network  perturbation control  voltage stabilization  
Online Optimization in Power Systems With High Penetration of Renewable Generation: Advances and Prospects 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 4, 页码: 839-858
作者:  Zhaojian Wang;  Wei Wei;  John Zhen Fu Pang;  Feng Liu;  Bo Yang;  Xinping Guan;  Shengwei Mei
Adobe PDF(2336Kb)  |  收藏  |  浏览/下载:168/31  |  提交时间:2023/03/22
Feedback optimization  Lyapunov optimization  online convex optimization  online optimization  optimization-guided control  
Policy Iteration for Optimal Control of Discrete-Time Time-Varying Nonlinear Systems 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 3, 页码: 781-791
作者:  Guangyu Zhu;  Xiaolu Li;  Ranran Sun;  Yiyuan Yang;  Peng Zhang
Adobe PDF(2432Kb)  |  收藏  |  浏览/下载:190/66  |  提交时间:2023/03/02
Adaptive critic designs  adaptive dynamic programming  approximate dynamic programming  optimal control  policy iteration  time-varying  
Dynamic Frontier-Led Swarming: Multi-Robot Repeated Coverage in Dynamic Environments 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 3, 页码: 646-661
作者:  Vu Phi Tran;  Matthew A. Garratt;  Kathryn Kasmarik;  Sreenatha G. Anavatti
Adobe PDF(7373Kb)  |  收藏  |  浏览/下载:185/39  |  提交时间:2023/03/02
Artificial pheromones  distributed control architecture  dynamic obstacle avoidance  multi-robot coverage  stigmergy  swarm robotics