CASIA OpenIR

浏览/检索结果: 共4条,第1-4条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:40/17  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:23/11  |  提交时间:2024/06/25
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:50/18  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning  
Compressing Speaker Extraction Model with Ultra-low Precision Quantization and Knowledge Distillation 期刊论文
Neural Networks, 2022, 卷号: 154, 页码: 13-21
作者:  Yating Huang;  Yunzhe Hao;  Jiaming Xu;  Bo Xu
Adobe PDF(801Kb)  |  收藏  |  浏览/下载:241/63  |  提交时间:2022/09/17