CASIA OpenIR

浏览/检索结果: 共37条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:37/14  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:21/9  |  提交时间:2024/06/25
Multi-objective Deep Reinforcement Learning for Mobile Edge Computing 会议论文
, Singapore, 2023/8/24-27
作者:  Yang,Ning;  Wen,Junrui;  Zhang,Meng;  Tang,Ming
Adobe PDF(499Kb)  |  收藏  |  浏览/下载:54/18  |  提交时间:2024/06/05
mobile edge computing  multi-objective reinforcement learning  resource scheduling  
D2AH-PPO: Playing ViZDoom With Object-Aware Hierarchical Reinforcement Learning 会议论文
, 中国重庆, 2024.5.7-5.9
作者:  Niu LY(钮龙宇);  Wan J(万军)
Adobe PDF(1645Kb)  |  收藏  |  浏览/下载:50/12  |  提交时间:2024/05/28
深度强化学习  表征学习  分层学习  
Spiking Adaptive Dynamic Programming with Poisson Process 会议论文
, 中国山东省青岛市, 2021-07-18
作者:  Wei QL(魏庆来);  Han LY(韩立元);  Zhang TL(张铁林)
Adobe PDF(2334Kb)  |  收藏  |  浏览/下载:54/17  |  提交时间:2024/05/28
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Yunpeng Bai;  Bin Zhang;  Dapeng Li;  Guoliang Fan
Adobe PDF(3345Kb)  |  收藏  |  浏览/下载:39/10  |  提交时间:2024/05/28
Learning to Coordinate via Multiple Graph Neural Networks 会议论文
, BALI, Indonesia, December 8-12, 2021
作者:  Zhiwei Xu;  Bin Zhang;  Yunpeng Bai;  Dapeng Li;  Guoliang Fan
Adobe PDF(2047Kb)  |  收藏  |  浏览/下载:51/21  |  提交时间:2024/05/28
Second-Order Global Attention Networks for Graph Classification and Regression 会议论文
, Beijing, China, August 27-28, 2022
作者:  Hu Fenyu;  Cui Zeyu;  Wu Shu;  Liu Qiang;  Wu Jinlin;  Wang Liang;  Tan Tieniu
Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:227/73  |  提交时间:2023/07/06
Multimodal Global Relation Knowledge Distillation for Egocentric Action Anticipation 会议论文
MM '21: Proceedings of the 29th ACM International Conference on Multimedia, Chengdu, China, 2021.10.20—2021.10.24
作者:  Huang Yi;  Yang Xiaoshan;  Xu Changsheng
Adobe PDF(1162Kb)  |  收藏  |  浏览/下载:191/75  |  提交时间:2023/06/21
Cooperative Multi-Agent Reinforcement Learning with Hypergraph Convolution 会议论文
, Padua, Italy, 18-23 July 2022
作者:  Yunpeng Bai;  Chen Gong;  Bin Zhang;  Guoliang Fan;  Xinwen Hou;  Yu Liu
Adobe PDF(8946Kb)  |  收藏  |  浏览/下载:150/40  |  提交时间:2023/06/14