CASIA OpenIR

浏览/检索结果: 共31条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:11/5  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:8/5  |  提交时间:2024/06/25
Improving Generalization of Multi-agent Reinforcement Learning through Domain-Invariant Feature Extraction 会议论文
, Greece, 2023-5
作者:  Xu YF(徐一凡);  Pu ZQ(蒲志强);  Cai QA(蔡奇昂);  Li FM(李非墨);  Chai XH(柴兴华)
Adobe PDF(7610Kb)  |  收藏  |  浏览/下载:8/4  |  提交时间:2024/06/21
M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文
, London, United Kingdom, 2023.5.29-2023.6.2
作者:  Meng Linghui;  Ruan Jingqing;  Xiong Xuantang;  Li Xiyun;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:20/5  |  提交时间:2024/06/11
Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文
, Turin, Italy, 2023.9.18-2023.9.22
作者:  Meng Linghui;  Xiong Xuantang;  Zang Yifan;  Zhang Xi;  Li Guoqi;  Xing Dengpeng;  Xu Bo
Adobe PDF(841Kb)  |  收藏  |  浏览/下载:26/10  |  提交时间:2024/06/11
Generative Calibration for In-context Learning 会议论文
, Singapore, 2023-10-6
作者:  Zhongtao Jiang;  Yuanzhe Zhang;  Cao Liu;  Jun Zhao;  Kang Liu
Adobe PDF(763Kb)  |  收藏  |  浏览/下载:21/7  |  提交时间:2024/06/06
Minimizing Age of Information for Mobile Edge Computing Systems: A Nested Index Approach 会议论文
, Singapore, 2023/8/24-27
作者:  Chen,Shuo;  Yang,Ning;  Zhang,Meng;  Wang,Jun
Adobe PDF(1413Kb)  |  收藏  |  浏览/下载:36/9  |  提交时间:2024/06/05
Multi-objective Deep Reinforcement Learning for Mobile Edge Computing 会议论文
, Singapore, 2023/8/24-27
作者:  Yang,Ning;  Wen,Junrui;  Zhang,Meng;  Tang,Ming
Adobe PDF(499Kb)  |  收藏  |  浏览/下载:35/12  |  提交时间:2024/06/05
mobile edge computing  multi-objective reinforcement learning  resource scheduling  
Fault Diagnosis for Robotic Fish Sensors based on Spatial Domain Image Fusion and Convolution Neural Network 会议论文
, Tianjin, China, 2023-7
作者:  Xuqing Fan;  Sai Deng;  Junfeng Fan;  Chao Zhou;  Zhengxing Wu;  Yaming Ou;  Bin Zhang
Adobe PDF(1492Kb)  |  收藏  |  浏览/下载:26/8  |  提交时间:2024/06/05
Fault Diagnosis  GAF Fusion  CNN  Robotic Fish  
Learning Superior Cooperative Policy in Competitive Multi-team Reinforcement Learning 会议论文
, Gold Coast, Australia, 2023-6
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Xiaolin Ai;  Wanmai Yuan
Adobe PDF(25675Kb)  |  收藏  |  浏览/下载:27/4  |  提交时间:2024/06/05