CASIA OpenIR

浏览/检索结果: 共129条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:24/12  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Online Optimization of Normalized CPGs for a Multi-Joint Robotic Fish 会议论文
, 中国,上海, 2021年7月
作者:  Tong R(仝茹);  Wu ZX(吴正兴);  Wang J(王健);  Tan M(谭民);  Yu JZ(喻俊志)
Adobe PDF(456Kb)  |  收藏  |  浏览/下载:18/10  |  提交时间:2024/06/26
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:30/12  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:16/9  |  提交时间:2024/06/25
MULFE: A Multi-Level Benchmark for Free Text Model Editing 会议论文
, Bangkok, Thailand, 2024-08
作者:  Wang, Chenhao;  Cao, Pengfei;  Jin, Zhuoran;  Chen, Yubo;  Zeng, Daojian;  Liu, Kang;  Zhao, Jun
Adobe PDF(571Kb)  |  收藏  |  浏览/下载:16/7  |  提交时间:2024/06/25
Hitch-Hiking Motion of Multiple Bionic Robotic Remoras with Enhanced Multimodal Locomotion 期刊论文
IEEE-ASME Transactions on Mechatronics, 2024, 页码: 1-11
作者:  Wu, Zhengxing;  Yu, Lianyi;  Wang, Jian;  Dai, Shijie;  Tan, Min;  Yu, Junzhi
Adobe PDF(4893Kb)  |  收藏  |  浏览/下载:43/22  |  提交时间:2024/06/24
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:35/12  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning  
Multi-Scale Dynamic Coding Improved Spiking Actor Network for Reinforcement Learning 会议论文
, Online, February 22–March 1, 2022
作者:  Zhang, Duzhen;  Zhang, Tielin;  Jia, Shuncheng;  Xu, Bo
Adobe PDF(2249Kb)  |  收藏  |  浏览/下载:34/13  |  提交时间:2024/06/11
Learning in bi-level markov games 会议论文
, Padua, Italy, 2022.7.18-2022.7.23
作者:  Meng Linghui;  Ruan Jingqing;  Xing Dengpeng;  Xu Bo
Adobe PDF(1450Kb)  |  收藏  |  浏览/下载:38/14  |  提交时间:2024/06/11
A New Pre-Training Paradigm for Offline Multi-Agent Reinforcement Learning with Suboptimal Data 会议论文
, Seoul, Korea, 2024.4.14-2024.4.19
作者:  Meng Linghui;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(964Kb)  |  收藏  |  浏览/下载:38/14  |  提交时间:2024/06/11