CASIA OpenIR

浏览/检索结果: 共34条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:49/22  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:45/17  |  提交时间:2024/06/25
强化学习,分层强化学习  
Robust Single-particle Cryo-EM Image Denoising and Restoration 会议论文
, Seoul, Korea,, 14-19 April 2024
作者:  Zhang Jing;  Tengfei Zhao;  ShiYu Hu;  Xin Zhao
Adobe PDF(966Kb)  |  收藏  |  浏览/下载:54/15  |  提交时间:2024/06/21
Controller Design and Stability Analysis for Spinning Missile Via Tensor Product 期刊论文
Aerospace Science and Technology, 2022, 页码: 107877
作者:  Zhiming Zhou;  Zhen Liu;  Yi Pan;  Jianqiang Yi
Adobe PDF(1047Kb)  |  收藏  |  浏览/下载:57/21  |  提交时间:2024/06/20
Minimizing Age of Information for Mobile Edge Computing Systems: A Nested Index Approach 会议论文
, Singapore, 2023/8/24-27
作者:  Chen,Shuo;  Yang,Ning;  Zhang,Meng;  Wang,Jun
Adobe PDF(1413Kb)  |  收藏  |  浏览/下载:61/14  |  提交时间:2024/06/05
Multi-objective Deep Reinforcement Learning for Mobile Edge Computing 会议论文
, Singapore, 2023/8/24-27
作者:  Yang,Ning;  Wen,Junrui;  Zhang,Meng;  Tang,Ming
Adobe PDF(499Kb)  |  收藏  |  浏览/下载:62/21  |  提交时间:2024/06/05
mobile edge computing  multi-objective reinforcement learning  resource scheduling  
Accelerate Dense Matrix Multiplication on Heterogeneous-GPUs 会议论文
, Ocean Flower Island, Hainan, China, 2023-12
作者:  Sun, Jianan;  Liao, Mingxue;  Chao, Yongyue;  Lv, Pin
Adobe PDF(261Kb)  |  收藏  |  浏览/下载:52/26  |  提交时间:2024/05/28
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:163/9  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system  
A Localization and Trajectory Planning Method for UAVs with Visual-Inertial Odometry 会议论文
, 日本札幌, 2022-7-11
作者:  Xu WB(徐文博);  Lin ZY(林子越);  Wang W(王伟)
Adobe PDF(4576Kb)  |  收藏  |  浏览/下载:92/36  |  提交时间:2023/09/12
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:217/53  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning