CASIA OpenIR

浏览/检索结果: 共360条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:7/4  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:7/5  |  提交时间:2024/06/25
Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文
, Turin, Italy, 2023.9.18-2023.9.22
作者:  Meng Linghui;  Xiong Xuantang;  Zang Yifan;  Zhang Xi;  Li Guoqi;  Xing Dengpeng;  Xu Bo
Adobe PDF(841Kb)  |  收藏  |  浏览/下载:21/8  |  提交时间:2024/06/11
Disturbance Observer-Based Predictive Tracking Control of Uncertain HOFA Cyber-Physical Systems 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1711-1713
作者:  Da-Wei Zhang;  Guo-Ping Liu
Adobe PDF(474Kb)  |  收藏  |  浏览/下载:29/15  |  提交时间:2024/06/07
Privacy-Preserving Average Consensus Algorithm Under Round-Robin Scheduling Protocol 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1705-1707
作者:  Yingjiang Guo;  Wenying Xu;  Haodong Wang;  Jianquan Lu;  Shengli Du
Adobe PDF(728Kb)  |  收藏  |  浏览/下载:19/9  |  提交时间:2024/06/07
Finite-Time Stabilization for Constrained Discrete-time Systems by Using Model Predictive Control 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1656-1666
作者:  Bing Zhu;  Xiaozhuoer Yuan;  Li Dai;  Zhiwen Qiang
Adobe PDF(1749Kb)  |  收藏  |  浏览/下载:30/14  |  提交时间:2024/06/07
Constraints  deadbeat control  finite-time stabilization  model predictive control (MPC)  
Ultimately Bounded Output Feedback Control for Networked Nonlinear Systems With Unreliable Communication Channel: A Buffer-Aided Strategy 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1566-1578
作者:  Yuhan Zhang;  Zidong Wang;  Lei Zou;  Yun Chen;  Guoping Lu
Adobe PDF(2016Kb)  |  收藏  |  浏览/下载:20/7  |  提交时间:2024/06/07
Buffer-aided strategy  neural networks  nonlinear control  output-feedback control  unreliable communication channel  
Nonlinear Filtering With Sample-Based Approximation Under Constrained Communication: Progress, Insights and Trends 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1539-1556
作者:  Weihao Song;  Zidong Wang;  Zhongkui Li;  Jianan Wang;  Qing-Long Han
Adobe PDF(1858Kb)  |  收藏  |  浏览/下载:22/7  |  提交时间:2024/06/07
Communication constraints  maximum correntropy filter  networked nonlinear filtering  particle filter  sample-based approximation  unscented Kalman filter  
A cooperation and decision-making framework in dynamic confrontation for multi-agent systems 期刊论文
Computers and Electrical Engineering, 2024, 页码: 118
作者:  Lexing Wang;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi
Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:22/5  |  提交时间:2024/06/06
基于脑脉冲序列的离散时间动态系统学习控制研究 学位论文
, 2024
作者:  韩立元
Adobe PDF(32282Kb)  |  收藏  |  浏览/下载:25/4  |  提交时间:2024/06/04
离散时间动态系统  脑脉冲序列  脉冲自适应动态规划  脉冲神经网络  多尺度动力学  脑机接口