CASIA OpenIR

浏览/检索结果: 共195条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文
, 日本, 2024-6
作者:  Zhang Qingyang;  Xu Bo
Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:7/4  |  提交时间:2024/06/25
强化学习,分层强化学习  
Soft Contrastive Learning with Q-irrelevance Abstraction for Reinforcement Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2023, 卷号: 15, 期号: 3, 页码: 1463 - 1473
作者:  Liu MS(刘民颂);  Li LT(李伦通);  Hao S(郝帅);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4197Kb)  |  收藏  |  浏览/下载:10/2  |  提交时间:2024/06/24
基于视觉表征的深度强化学习方法 学位论文
, 2024
作者:  刘民颂
Adobe PDF(10778Kb)  |  收藏  |  浏览/下载:15/1  |  提交时间:2024/06/22
深度强化学习,视觉表征学习,自监督学习,状态抽象,Transformer神经网络  
Improving Generalization of Multi-agent Reinforcement Learning through Domain-Invariant Feature Extraction 会议论文
, Greece, 2023-5
作者:  Xu YF(徐一凡);  Pu ZQ(蒲志强);  Cai QA(蔡奇昂);  Li FM(李非墨);  Chai XH(柴兴华)
Adobe PDF(7610Kb)  |  收藏  |  浏览/下载:7/4  |  提交时间:2024/06/21
面向多目标覆盖任务的深度强化学习迁移泛化方法研究 学位论文
, 2024
作者:  徐一凡
Adobe PDF(20521Kb)  |  收藏  |  浏览/下载:24/2  |  提交时间:2024/06/20
多目标覆盖任务  强化学习  迁移泛化  课程学习  域自适应  环境偏移  
M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文
, London, United Kingdom, 2023.5.29-2023.6.2
作者:  Meng Linghui;  Ruan Jingqing;  Xiong Xuantang;  Li Xiyun;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:18/4  |  提交时间:2024/06/11
Learn to flap: foil non-parametric path planning via deep reinforcement learning 期刊论文
Journal of Fluid Mechanics, 2024, 卷号: 984, 页码: A9
作者:  Wang, Zhipeng;  Lin, Runji;  Zhao, Zhiyu;  Chen, Xu;  Guo, Pengming;  Yang, Ning;  Wang,Zhicheng;  Fan, Dixia
Adobe PDF(1892Kb)  |  收藏  |  浏览/下载:26/3  |  提交时间:2024/06/07
Self-Triggered Set Stabilization of Boolean Control Networks and Its Applications 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1631-1642
作者:  Rong Zhao;  Jun-e Feng;  Dawei Zhang
Adobe PDF(1665Kb)  |  收藏  |  浏览/下载:16/9  |  提交时间:2024/06/07
Boolean control networks (BCNs)  output regulation  self-triggered control  semi-tensor product of matrices  set stabilization  synchronization  
Discovering Latent Variables for the Tasks With Confounders in Multi-Agent Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1591-1604
作者:  Kun Jiang;  Wenzhang Liu;  Yuanda Wang;  Lu Dong;  Changyin Sun
Adobe PDF(2128Kb)  |  收藏  |  浏览/下载:21/7  |  提交时间:2024/06/07
Latent variable model  maximum entropy  multi-agent reinforcement learning (MARL)  multi-agent system  
Nonlinear Filtering With Sample-Based Approximation Under Constrained Communication: Progress, Insights and Trends 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1539-1556
作者:  Weihao Song;  Zidong Wang;  Zhongkui Li;  Jianan Wang;  Qing-Long Han
Adobe PDF(1858Kb)  |  收藏  |  浏览/下载:22/7  |  提交时间:2024/06/07
Communication constraints  maximum correntropy filter  networked nonlinear filtering  particle filter  sample-based approximation  unscented Kalman filter