已选(0)清除
条数/页: 排序方式: |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang ; Yang Yiming ; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(7948Kb)  |   收藏  |  浏览/下载:9/5  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang ; Zhang Hongming; Xing Dengpeng ; Bo Xu![](/image/person.jpg)
Adobe PDF(9639Kb)  |   收藏  |  浏览/下载:8/5  |  提交时间:2024/06/25 |
| Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文 Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4 作者: Runji, Lin ; Haifeng, Zhang
Adobe PDF(8334Kb)  |   收藏  |  浏览/下载:21/6  |  提交时间:2024/06/11 Networked System Control Robustness Communicative Multi-Agent Reinforcement Learning |
| Multi-objective Deep Reinforcement Learning for Mobile Edge Computing 会议论文 , Singapore, 2023/8/24-27 作者: Yang,Ning ; Wen,Junrui; Zhang,Meng ; Tang,Ming![](/image/person.jpg)
Adobe PDF(499Kb)  |   收藏  |  浏览/下载:33/12  |  提交时间:2024/06/05 mobile edge computing multi-objective reinforcement learning resource scheduling |
| Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文 , Washington, DC, USA, February 7-14, 2023 作者: Zhiwei Xu ; Bin Zhang; Dapeng Li ; Zeren Zhang; Guangchong Zhou; Hao Chen ; Guoliang Fan![](/image/person.jpg)
Adobe PDF(4141Kb)  |   收藏  |  浏览/下载:25/8  |  提交时间:2024/05/28 |
| HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文 , Washington, DC, USA, February 7-14, 2023 作者: Zhiwei Xu ; Yunpeng Bai ; Bin Zhang; Dapeng Li ; Guoliang Fan![](/image/person.jpg)
Adobe PDF(3345Kb)  |   收藏  |  浏览/下载:23/6  |  提交时间:2024/05/28 |
| A Multi-modal Global Instance Tracking Benchmark (MGIT): Better Locating Target in Complex Spatio-temporal and causal Relationship 会议论文 , New Orleans, 2023-12 作者: Shiyu, Hu ; Dailing, Zhang; Meiqi, Wu; Xiaokun, Feng; Xuchen, Li; Xin, Zhao; Kaiqi, Huang![](/image/person.jpg)
Adobe PDF(6215Kb)  |   收藏  |  浏览/下载:107/23  |  提交时间:2024/01/22 |
| AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文 , 线上, 2022-02-22 作者: Zhao EM(赵恩民) ; Yan RY(闫仁业); Li JQ(李金秋); Li K(李凯) ; Xing JL(兴军亮)![](/image/person.jpg)
Adobe PDF(2593Kb)  |   收藏  |  浏览/下载:192/71  |  提交时间:2023/06/29 |
| Multi-Granularity Pruning for Model Acceleration on Mobile Devices 会议论文 , 线上, 2022-07 作者: Zhao TL(赵天理) ; Zhang X(张希); Zhu WT(朱文涛); Wang JX(王家兴) ; Yang S(杨森); Liu J(刘季); Cheng J(程健)![](/image/person.jpg)
Adobe PDF(1919Kb)  |   收藏  |  浏览/下载:130/52  |  提交时间:2023/06/21 Deep Neural Networks Network Pruning Structured Pruning Non-structured Pruning Single Instruction Multiple Data |
| Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition 会议论文 , Washington D.C., USA, 2023-2-9 作者: Qingyu Wang ; Tielin Zhang ; Minglun Han ; Yi Wang ; Duzhen Zhang; Bo Xu![](/image/person.jpg)
Adobe PDF(1714Kb)  |   收藏  |  浏览/下载:172/50  |  提交时间:2023/06/20 |