CASIA OpenIR

浏览/检索结果: 共59条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文
, 日本, 2024-6
作者:  Zhang Qingyang;  Xu Bo
Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:25/8  |  提交时间:2024/06/25
强化学习,分层强化学习  
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:41/16  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:23/11  |  提交时间:2024/06/25
Multi-Scale Dynamic Coding Improved Spiking Actor Network for Reinforcement Learning 会议论文
, Online, February 22–March 1, 2022
作者:  Zhang, Duzhen;  Zhang, Tielin;  Jia, Shuncheng;  Xu, Bo
Adobe PDF(2249Kb)  |  收藏  |  浏览/下载:43/15  |  提交时间:2024/06/11
M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文
, London, United Kingdom, 2023.5.29-2023.6.2
作者:  Meng Linghui;  Ruan Jingqing;  Xiong Xuantang;  Li Xiyun;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:37/11  |  提交时间:2024/06/11
Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文
, Turin, Italy, 2023.9.18-2023.9.22
作者:  Meng Linghui;  Xiong Xuantang;  Zang Yifan;  Zhang Xi;  Li Guoqi;  Xing Dengpeng;  Xu Bo
Adobe PDF(841Kb)  |  收藏  |  浏览/下载:47/19  |  提交时间:2024/06/11
A New Pre-Training Paradigm for Offline Multi-Agent Reinforcement Learning with Suboptimal Data 会议论文
, Seoul, Korea, 2024.4.14-2024.4.19
作者:  Meng Linghui;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(964Kb)  |  收藏  |  浏览/下载:50/20  |  提交时间:2024/06/11
Self-Lateral Propagation Elevates Synaptic Modifications in Spiking Neural Networks for the Efficient Spatial and Temporal Classification 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Zhang, Tielin;  Wang, Qingyu;  Xu, Bo
收藏  |  浏览/下载:130/0  |  提交时间:2023/11/17
Self-lateral propagation (SLP)  spatial classifica-tion  spiking neural network (SNN)  synaptic plasticity  temporal classification  
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:212/50  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:178/35  |  提交时间:2023/06/21