已选(0)清除
条数/页: 排序方式: |
| Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文 , 日本, 2024-6 作者: Zhang Qingyang; Xu Bo Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:25/8  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang; Yang Yiming; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng; Xu Bo Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:41/16  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang; Zhang Hongming; Xing Dengpeng; Bo Xu Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:23/11  |  提交时间:2024/06/25 |
| Multi-Scale Dynamic Coding Improved Spiking Actor Network for Reinforcement Learning 会议论文 , Online, February 22–March 1, 2022 作者: Zhang, Duzhen; Zhang, Tielin; Jia, Shuncheng; Xu, Bo Adobe PDF(2249Kb)  |  收藏  |  浏览/下载:43/15  |  提交时间:2024/06/11 |
| M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文 , London, United Kingdom, 2023.5.29-2023.6.2 作者: Meng Linghui; Ruan Jingqing; Xiong Xuantang; Li Xiyun; Zhang Xi; Xing Dengpeng; Xu Bo Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:37/11  |  提交时间:2024/06/11 |
| Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文 , Turin, Italy, 2023.9.18-2023.9.22 作者: Meng Linghui; Xiong Xuantang; Zang Yifan; Zhang Xi; Li Guoqi; Xing Dengpeng; Xu Bo Adobe PDF(841Kb)  |  收藏  |  浏览/下载:47/19  |  提交时间:2024/06/11 |
| A New Pre-Training Paradigm for Offline Multi-Agent Reinforcement Learning with Suboptimal Data 会议论文 , Seoul, Korea, 2024.4.14-2024.4.19 作者: Meng Linghui; Zhang Xi; Xing Dengpeng; Xu Bo Adobe PDF(964Kb)  |  收藏  |  浏览/下载:50/20  |  提交时间:2024/06/11 |
| Self-Lateral Propagation Elevates Synaptic Modifications in Spiking Neural Networks for the Efficient Spatial and Temporal Classification 期刊论文 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13 作者: Zhang, Tielin; Wang, Qingyu; Xu, Bo 收藏  |  浏览/下载:130/0  |  提交时间:2023/11/17 Self-lateral propagation (SLP) spatial classifica-tion spiking neural network (SNN) synaptic plasticity temporal classification |
| PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文 Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14 作者: Bai FS(白丰硕); Zhang HM(张鸿铭); Tao TY(陶天阳); Wu ZH(武志亨); Wang YN(王燕娜); Xu B(徐博) Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:212/50  |  提交时间:2023/07/05 Reinforcement Learning Algorithms Transfer Domain Adaptation Multi-Task Learning |
| VLP: A Survey on Vision-language Pre-training 期刊论文 Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56 作者: Feilong Chen; Duzhen Zhang; Minglun Han; Xiuyi Chen; Jing Shi; Shuang Xu; Bo Xu Adobe PDF(969Kb)  |  收藏  |  浏览/下载:178/35  |  提交时间:2023/06/21 |