已选(0)清除
条数/页: 排序方式: |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang ; Yang Yiming ; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(7948Kb)  |   收藏  |  浏览/下载:7/4  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang ; Zhang Hongming; Xing Dengpeng ; Bo Xu![](/image/person.jpg)
Adobe PDF(9639Kb)  |   收藏  |  浏览/下载:7/5  |  提交时间:2024/06/25 |
| Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文 , Turin, Italy, 2023.9.18-2023.9.22 作者: Meng Linghui ; Xiong Xuantang; Zang Yifan; Zhang Xi ; Li Guoqi ; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(841Kb)  |   收藏  |  浏览/下载:21/8  |  提交时间:2024/06/11 |
| Disturbance Observer-Based Predictive Tracking Control of Uncertain HOFA Cyber-Physical Systems 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1711-1713 作者: Da-Wei Zhang ; Guo-Ping Liu
Adobe PDF(474Kb)  |   收藏  |  浏览/下载:29/15  |  提交时间:2024/06/07 |
| Privacy-Preserving Average Consensus Algorithm Under Round-Robin Scheduling Protocol 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1705-1707 作者: Yingjiang Guo; Wenying Xu; Haodong Wang; Jianquan Lu; Shengli Du
Adobe PDF(728Kb)  |   收藏  |  浏览/下载:19/9  |  提交时间:2024/06/07 |
| Finite-Time Stabilization for Constrained Discrete-time Systems by Using Model Predictive Control 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1656-1666 作者: Bing Zhu; Xiaozhuoer Yuan; Li Dai; Zhiwen Qiang
Adobe PDF(1749Kb)  |   收藏  |  浏览/下载:30/14  |  提交时间:2024/06/07 Constraints deadbeat control finite-time stabilization model predictive control (MPC) |
| Ultimately Bounded Output Feedback Control for Networked Nonlinear Systems With Unreliable Communication Channel: A Buffer-Aided Strategy 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1566-1578 作者: Yuhan Zhang; Zidong Wang; Lei Zou; Yun Chen; Guoping Lu
Adobe PDF(2016Kb)  |   收藏  |  浏览/下载:20/7  |  提交时间:2024/06/07 Buffer-aided strategy neural networks nonlinear control output-feedback control unreliable communication channel |
| Nonlinear Filtering With Sample-Based Approximation Under Constrained Communication: Progress, Insights and Trends 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1539-1556 作者: Weihao Song; Zidong Wang; Zhongkui Li; Jianan Wang; Qing-Long Han
Adobe PDF(1858Kb)  |   收藏  |  浏览/下载:22/7  |  提交时间:2024/06/07 Communication constraints maximum correntropy filter networked nonlinear filtering particle filter sample-based approximation unscented Kalman filter |
| A cooperation and decision-making framework in dynamic confrontation for multi-agent systems 期刊论文 Computers and Electrical Engineering, 2024, 页码: 118 作者: Lexing Wang; Tenghai Qiu ; Zhiqiang Pu ; Jianqiang Yi![](/image/person.jpg)
Adobe PDF(1302Kb)  |   收藏  |  浏览/下载:22/5  |  提交时间:2024/06/06 |
| 基于脑脉冲序列的离散时间动态系统学习控制研究 学位论文 , 2024 作者: 韩立元![](/image/person.jpg)
Adobe PDF(32282Kb)  |   收藏  |  浏览/下载:25/4  |  提交时间:2024/06/04 离散时间动态系统 脑脉冲序列 脉冲自适应动态规划 脉冲神经网络 多尺度动力学 脑机接口 |