CASIA OpenIR

浏览/检索结果: 共30条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:35/15  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:35/14  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:20/9  |  提交时间:2024/06/25
Multi-Scale Dynamic Coding Improved Spiking Actor Network for Reinforcement Learning 会议论文
, Online, February 22–March 1, 2022
作者:  Zhang, Duzhen;  Zhang, Tielin;  Jia, Shuncheng;  Xu, Bo
Adobe PDF(2249Kb)  |  收藏  |  浏览/下载:37/14  |  提交时间:2024/06/11
Minimizing Age of Information for Mobile Edge Computing Systems: A Nested Index Approach 会议论文
, Singapore, 2023/8/24-27
作者:  Chen,Shuo;  Yang,Ning;  Zhang,Meng;  Wang,Jun
Adobe PDF(1413Kb)  |  收藏  |  浏览/下载:49/10  |  提交时间:2024/06/05
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Yunpeng Bai;  Bin Zhang;  Dapeng Li;  Guoliang Fan
Adobe PDF(3345Kb)  |  收藏  |  浏览/下载:35/9  |  提交时间:2024/05/28
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Shenzhen, China, 18-22 July 2021
作者:  Zhiwei Xu;  Dapeng Li;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(3892Kb)  |  收藏  |  浏览/下载:21/11  |  提交时间:2024/05/28
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:206/47  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Stealthy false data injection attacks against extended Kalman filter detection in power grids 会议论文
Proceedings of 2021 8th International Conference on Information, Cybernetics, and Computational Social Systems, Beijing, China, 2021.12.10-12
作者:  Liu, Yifa;  Cheng, Long
Adobe PDF(1623Kb)  |  收藏  |  浏览/下载:153/39  |  提交时间:2023/06/28
False data injection, state estimation, extended Kalman filter, attack sequence  
Disturbance Observer Based Control for an Underwater Biomimetic Vehicle-Manipulator System with Mismatched Disturbances 会议论文
, Suzhou, China, 2021.5.14-2021.5.16
作者:  Lv, Jiaqi;  Wang, Yu;  Wang, Shuo;  Cheng, Long;  Tan, Min
Adobe PDF(1182Kb)  |  收藏  |  浏览/下载:136/46  |  提交时间:2023/06/25
Underwater biomimetic vehicle-manipulator system (UBVMS)  disturbance observer  arctangent non-singularity sliding mode controller (ANTSMC)  mismatched disturbances