CASIA OpenIR

浏览/检索结果: 共11条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Efficient Exploration for Multi-Agent Reinforcement Learning via Transferable Successor Features 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 9, 页码: 1673-1686
作者:  Wenzhang Liu;  Lu Dong;  Dan Niu;  Changyin Sun
Adobe PDF(5554Kb)  |  收藏  |  浏览/下载:181/78  |  提交时间:2022/08/19
Knowledge transfer  multi-agent systems  reinforcement learning  successor features  
Reinforcement Learning Behavioral Control for Nonlinear Autonomous System 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 9, 页码: 1561-1573
作者:  Zhenyi Zhang;  Zhibin Mo;  Yutao Chen;  Jie Huang
Adobe PDF(2249Kb)  |  收藏  |  浏览/下载:170/47  |  提交时间:2022/08/19
Behavioral control  mission supervisor  nonlinear autonomous system  reinforcement learning  
Visuals to Text: A Comprehensive Review on Automatic Image Captioning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 8, 页码: 1339-1365
作者:  Yue Ming;  Nannan Hu;  Chunxiao Fan;  Fan Feng;  Jiangwan Zhou;  Hui Yu
Adobe PDF(56128Kb)  |  收藏  |  浏览/下载:185/23  |  提交时间:2022/08/01
Artificial intelligence  attention mechanism  encoder-decoder framework  image captioning  multi-modal understanding  training strategies  
Discounted Iterative Adaptive Critic Designs With Novel Stability Analysis for Tracking Control 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 7, 页码: 1262-1272
作者:  Mingming Ha;  Ding Wang;  Derong Liu
Adobe PDF(1832Kb)  |  收藏  |  浏览/下载:254/85  |  提交时间:2022/06/27
Adaptive critic design  adaptive dynamic programming (ADP)  approximate dynamic programming  discrete-time nonlinear systems  reinforcement learning  stability analysis  tracking control  value iteration (VI)  
Towards Long Lifetime Battery: AI-Based Manufacturing and Management 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 7, 页码: 1139-1165
作者:  Kailong Liu;  Zhongbao Wei;  Chenghui Zhang;  Yunlong Shang;  Remus Teodorescu;  Qing-Long Han
Adobe PDF(10020Kb)  |  收藏  |  浏览/下载:204/37  |  提交时间:2022/06/27
Artificial intelligence  battery health management  battery life diagnostic  battery manufacturing  smart battery  
Cooperative and Competitive Multi-Agent Systems: From Optimization to Games 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 5, 页码: 763-783
作者:  Jianrui Wang;  Yitian Hong;  Jiali Wang;  Jiapeng Xu;  Yang Tang;  Qing-Long Han;  Jürgen Kurths
Adobe PDF(8407Kb)  |  收藏  |  浏览/下载:235/63  |  提交时间:2022/04/24
Cooperative games  counterfactual regret min- imization  distributed optimization  federated optimization  fictitious self-play  mean field games  multi-agent reinforcement learning  non-cooperative games  
Optimal Synchronization Control of Heterogeneous Asymmetric Input-Constrained Unknown Nonlinear MASs via Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 3, 页码: 520-532
作者:  Lina Xia;  Qing Li;  Ruizhuo Song;  Hamidreza Modares
Adobe PDF(2201Kb)  |  收藏  |  浏览/下载:252/59  |  提交时间:2022/03/09
Asymmetric input-constrained  heterogeneous nonlinear multiagent systems (MASs)  Hamilton-Jacobi-Bellman (HJB) equation  novel observer  reinforcement learning (RL)  
Conflict-Aware Safe Reinforcement Learning: A Meta-Cognitive Learning Framework 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 3, 页码: 466-481
作者:  Majid Mazouchi;  Subramanya Nageshrao;  Hamidreza Modares
Adobe PDF(11662Kb)  |  收藏  |  浏览/下载:171/20  |  提交时间:2022/03/09
Optimal control  receding-horizon attentional controller (RHAC)  reinforcement learning (RL)  
Cyber Security Intrusion Detection for Agriculture 4.0: Machine Learning-Based Solutions, Datasets, and Future Directions 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 3, 页码: 407-436
作者:  Mohamed Amine Ferrag;  Lei Shu;  Othmane Friha;  Xing Yang
Adobe PDF(23596Kb)  |  收藏  |  浏览/下载:180/16  |  提交时间:2022/03/09
Agriculture 4.0  cyber security  intrusion detection system  machine learning approaches  smart agriculture  
Data-Driven Human-Robot Interaction Without Velocity Measurement Using Off-Policy Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 1, 页码: 47-63
作者:  Yongliang Yang;  Zihao Ding;  Rui Wang;  Hamidreza Modares;  Donald C. Wunsch
Adobe PDF(15362Kb)  |  收藏  |  浏览/下载:261/50  |  提交时间:2021/11/03
Adaptive impedance control  data-driven method  human-robot interaction (HRI)  reinforcement learning  velocity-free