CASIA OpenIR

浏览/检索结果: 共29条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Discovering Latent Variables for the Tasks With Confounders in Multi-Agent Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1591-1604
作者:  Kun Jiang;  Wenzhang Liu;  Yuanda Wang;  Lu Dong;  Changyin Sun
Adobe PDF(2128Kb)  |  收藏  |  浏览/下载:31/10  |  提交时间:2024/06/07
Latent variable model  maximum entropy  multi-agent reinforcement learning (MARL)  multi-agent system  
Computational Experiments for Complex Social Systems: Experiment Design and Generative Explanation 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 4, 页码: 1022-1038
作者:  Xiao Xue;  Deyu Zhou;  Xiangning Yu;  Gang Wang;  Juanjuan Li;  Xia Xie;  Lizhen Cui;  Fei-Yue Wang
Adobe PDF(7239Kb)  |  收藏  |  浏览/下载:64/15  |  提交时间:2024/03/18
Agent-based modeling  computational experiments  cyber-physical-social systems (CPSS)  generative deduction  generative experiments  meta model  
Equilibrium Strategy of the Pursuit-Evasion Game in Three-Dimensional Space 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 2, 页码: 446-458
作者:  Nuo Chen;  Linjing Li;  Wenji Mao
Adobe PDF(3567Kb)  |  收藏  |  浏览/下载:139/33  |  提交时间:2024/01/23
Differential game  equilibrium strategy  pursuit-evasion game  threedegree-of-freedom control  
Advancements in Humanoid Robots: A Comprehensive Review and Future Prospects 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 2, 页码: 301-328
作者:  Yuchuang Tong;  Haotian Liu;  Zhengtao Zhang
Adobe PDF(7587Kb)  |  收藏  |  浏览/下载:138/33  |  提交时间:2024/01/23
Future trends and challenges  humanoid robots  human-robot interaction  key technologies  potential applications  
Reinforcement Learning in Process Industries: Review and Perspective 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 2, 页码: 283-300
作者:  Oguzhan Dogru;  Junyao Xie;  Om Prakash;  Ranjith Chiplunkar;  Jansen Soesanto;  Hongtian Chen;  Kirubakaran Velswamy;  Fadi Ibrahim;  Biao Huang
Adobe PDF(1275Kb)  |  收藏  |  浏览/下载:62/21  |  提交时间:2024/01/23
Process control  process systems engineering  reinforcement learning  
Path Planning and Tracking Control for Parking via Soft Actor-Critic Under Non-Ideal Scenarios 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 1, 页码: 181-195
作者:  Xiaolin Tang;  Yuyou Yang;  Teng Liu;  Xianke Lin;  Kai Yang;  Shen Li
Adobe PDF(4905Kb)  |  收藏  |  浏览/下载:231/132  |  提交时间:2024/01/02
Automatic parking  control strategy  parking deviation (APS)  soft actor-critic (SAC)  
Recent Progress in Reinforcement Learning and Adaptive Dynamic Programming for Advanced Control Applications 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 1, 页码: 18-36
作者:  Ding Wang;  Ning Gao;  Derong Liu;  Jinna Li;  Frank L. Lewis
Adobe PDF(1945Kb)  |  收藏  |  浏览/下载:293/196  |  提交时间:2024/01/02
Adaptive dynamic programming (ADP)  advanced control  complex environment  data-driven control  event-triggered design  intelligent control  neural networks  nonlinear systems  optimal control  reinforcement learning (RL)  
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 12, 页码: 2233-2247
作者:  Hongyu Ding;  Yuanze Tang;  Qing Wu;  Bo Wang;  Chunlin Chen;  Zhi Wang
Adobe PDF(5205Kb)  |  收藏  |  浏览/下载:118/39  |  提交时间:2023/10/31
Dynamic environments  goal-conditioned reinforcement learning  magnetic field  reward shaping  
Intelligent Electric Vehicle Charging Scheduling in Transportation-Energy Nexus With Distributional Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 11, 页码: 2171-2173
作者:  Tao Chen;  Ciwei Gao
Adobe PDF(577Kb)  |  收藏  |  浏览/下载:55/25  |  提交时间:2023/09/22
Privacy Preserving Demand Side Management Method via Multi-Agent Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 10, 页码: 1984-1999
作者:  Feiye Zhang;  Qingyu Yang;  Dou An
Adobe PDF(3841Kb)  |  收藏  |  浏览/下载:104/58  |  提交时间:2023/09/07
Centralized training and decentralized execution  demand side management  multi-agent reinforcement learning  privacy preserving