已选(0)清除
条数/页: 排序方式: |
| Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文 , 日本, 2024-6 作者: Zhang Qingyang; Xu Bo Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:7/4  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Soft Contrastive Learning with Q-irrelevance Abstraction for Reinforcement Learning 期刊论文 IEEE Transactions on Cognitive and Developmental Systems, 2023, 卷号: 15, 期号: 3, 页码: 1463 - 1473 作者: Liu MS(刘民颂); Li LT(李伦通); Hao S(郝帅); Zhu YH(朱圆恒); Zhao DB(赵冬斌) Adobe PDF(4197Kb)  |  收藏  |  浏览/下载:10/2  |  提交时间:2024/06/24 |
| 基于视觉表征的深度强化学习方法 学位论文 , 2024 作者: 刘民颂 Adobe PDF(10778Kb)  |  收藏  |  浏览/下载:15/1  |  提交时间:2024/06/22 深度强化学习,视觉表征学习,自监督学习,状态抽象,Transformer神经网络 |
| Improving Generalization of Multi-agent Reinforcement Learning through Domain-Invariant Feature Extraction 会议论文 , Greece, 2023-5 作者: Xu YF(徐一凡); Pu ZQ(蒲志强); Cai QA(蔡奇昂); Li FM(李非墨); Chai XH(柴兴华) Adobe PDF(7610Kb)  |  收藏  |  浏览/下载:7/4  |  提交时间:2024/06/21 |
| 面向多目标覆盖任务的深度强化学习迁移泛化方法研究 学位论文 , 2024 作者: 徐一凡 Adobe PDF(20521Kb)  |  收藏  |  浏览/下载:24/2  |  提交时间:2024/06/20 多目标覆盖任务 强化学习 迁移泛化 课程学习 域自适应 环境偏移 |
| M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文 , London, United Kingdom, 2023.5.29-2023.6.2 作者: Meng Linghui; Ruan Jingqing; Xiong Xuantang; Li Xiyun; Zhang Xi; Xing Dengpeng; Xu Bo Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:18/4  |  提交时间:2024/06/11 |
| Learn to flap: foil non-parametric path planning via deep reinforcement learning 期刊论文 Journal of Fluid Mechanics, 2024, 卷号: 984, 页码: A9 作者: Wang, Zhipeng; Lin, Runji; Zhao, Zhiyu; Chen, Xu; Guo, Pengming; Yang, Ning; Wang,Zhicheng; Fan, Dixia Adobe PDF(1892Kb)  |  收藏  |  浏览/下载:26/3  |  提交时间:2024/06/07 |
| Self-Triggered Set Stabilization of Boolean Control Networks and Its Applications 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1631-1642 作者: Rong Zhao; Jun-e Feng; Dawei Zhang Adobe PDF(1665Kb)  |  收藏  |  浏览/下载:16/9  |  提交时间:2024/06/07 Boolean control networks (BCNs) output regulation self-triggered control semi-tensor product of matrices set stabilization synchronization |
| Discovering Latent Variables for the Tasks With Confounders in Multi-Agent Reinforcement Learning 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1591-1604 作者: Kun Jiang; Wenzhang Liu; Yuanda Wang; Lu Dong; Changyin Sun Adobe PDF(2128Kb)  |  收藏  |  浏览/下载:21/7  |  提交时间:2024/06/07 Latent variable model maximum entropy multi-agent reinforcement learning (MARL) multi-agent system |
| Nonlinear Filtering With Sample-Based Approximation Under Constrained Communication: Progress, Insights and Trends 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1539-1556 作者: Weihao Song; Zidong Wang; Zhongkui Li; Jianan Wang; Qing-Long Han Adobe PDF(1858Kb)  |  收藏  |  浏览/下载:22/7  |  提交时间:2024/06/07 Communication constraints maximum correntropy filter networked nonlinear filtering particle filter sample-based approximation unscented Kalman filter |