CASIA OpenIR

浏览/检索结果: 共81条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Learning Top-K Subtask Planning Tree Based on Discriminative Representation Pretraining for Decision-making 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 782-800
作者:  Jingqing Ruan;   Kaishen Wang;   Qingyang Zhang;   Dengpeng Xing;   Bo Xu
Adobe PDF(4577Kb)  |  收藏  |  浏览/下载:21/9  |  提交时间:2024/07/18
Reinforcement learning  representation learning  subtask planning  task decomposition  pretraining.  
Novel Adaptive Memory Event-Triggered-Based Fuzzy Robust Control for Nonlinear Networked Systems via the Differential Evolution Algorithm 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 8, 页码: 1836-1848
作者:  Wei Qian;  Yanmin Wu;  Bo Shen
Adobe PDF(2197Kb)  |  收藏  |  浏览/下载:32/12  |  提交时间:2024/07/16
Adaptive memory event-triggered (AMET)  differential evolution algorithm  fuzzy optimization robust control  interval type-2 (IT2) fuzzy technique  
Deep Reinforcement Learning or Lyapunov Analysis? A Preliminary Comparative Study on Event-Triggered Optimal Control 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1702-1704
作者:  Jingwei Lu;  Lefei Li;  Qinglai Wei;  Fei-Yue Wang
Adobe PDF(501Kb)  |  收藏  |  浏览/下载:63/18  |  提交时间:2024/06/07
Discovering Latent Variables for the Tasks With Confounders in Multi-Agent Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1591-1604
作者:  Kun Jiang;  Wenzhang Liu;  Yuanda Wang;  Lu Dong;  Changyin Sun
Adobe PDF(2128Kb)  |  收藏  |  浏览/下载:48/18  |  提交时间:2024/06/07
Latent variable model  maximum entropy  multi-agent reinforcement learning (MARL)  multi-agent system  
Interpolated Bumpless Transfer Control for Asynchronously Switched Linear Systems 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1579-1590
作者:  Shengao Lu;  Tong Wu;  Lixian Zhang;  Jianan Yang;  Ye Liang
Adobe PDF(2718Kb)  |  收藏  |  浏览/下载:40/15  |  提交时间:2024/06/07
Asynchronous switching  bumpless transfer  H control  switched systems  
Ultimately Bounded Output Feedback Control for Networked Nonlinear Systems With Unreliable Communication Channel: A Buffer-Aided Strategy 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1566-1578
作者:  Yuhan Zhang;  Zidong Wang;  Lei Zou;  Yun Chen;  Guoping Lu
Adobe PDF(2016Kb)  |  收藏  |  浏览/下载:38/13  |  提交时间:2024/06/07
Buffer-aided strategy  neural networks  nonlinear control  output-feedback control  unreliable communication channel  
An Empirical Study on Google Research Football Multi-agent Scenarios 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 549-570
作者:  Yan Song;  He Jiang;  Zheng Tian;  Haifeng Zhang;  Yingping Zhang;  Jiangcheng Zhu;  Zonghong Dai;  Weinan Zhang;  Jun Wang
Adobe PDF(24588Kb)  |  收藏  |  浏览/下载:63/19  |  提交时间:2024/05/23
Multi-agent reinforcement learning (RL), distributed RL system, population-based training, reward shaping, game theory  
Distributed Deep Reinforcement Learning: A Survey and a Multi-player Multi-agent Learning Toolbox 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 411-430
作者:  Qiyue Yin;  Tongtong Yu;  Shengqi Shen;  Jun Yang;  Meijing Zhao;  Wancheng Ni;  Kaiqi Huang;  Bin Liang;  Liang Wang
Adobe PDF(2923Kb)  |  收藏  |  浏览/下载:54/21  |  提交时间:2024/05/23
Deep reinforcement learning, distributed machine learning, self-play, population-play, toolbox  
基于折扣广义值迭代的智能最优跟踪及应用验证 期刊论文
自动化学报, 2022, 卷号: 48, 期号: 1, 页码: 182-193
作者:  王鼎;  赵明明;  哈明鸣;  乔俊飞
Adobe PDF(2055Kb)  |  收藏  |  浏览/下载:46/15  |  提交时间:2024/05/20
自适应评判控制  可容许性  广义值迭代  智能最优跟踪  神经网络  
迭代学习模型预测控制研究现状与挑战 期刊论文
自动化学报, 2022, 卷号: 48, 期号: 6, 页码: 1385-1401
作者:  马乐乐;  刘向杰;  高福荣
Adobe PDF(1566Kb)  |  收藏  |  浏览/下载:30/15  |  提交时间:2024/05/20
迭代学习模型预测控制  二维预测模型  控制律迭代优化  复杂非线性系统  快速系统  变工况