CASIA OpenIR

浏览/检索结果: 共184条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Tacit Commitments Emergence in Multi-agent Reinforcement Learning 会议论文
, New Delhi, India, 2023-7
作者:  Liu BY(刘博寅);  Zhiqiang Pu;  Junlong Gao;  Jianqiang Yi;  Zhenyu Guo
Adobe PDF(932Kb)  |  收藏  |  浏览/下载:15/6  |  提交时间:2024/07/15
Improved Self-Propelled Swarms Model with Enhanced Convergence Efficiency 会议论文
, Tianjing, China, 2020
作者:  Boyin Liu;  Zhiqiang Pu;  Shiguang Wu;  Lele Wang
Adobe PDF(210Kb)  |  收藏  |  浏览/下载:17/7  |  提交时间:2024/07/12
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:35/15  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:35/14  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:20/9  |  提交时间:2024/06/25
Robust Single-particle Cryo-EM Image Denoising and Restoration 会议论文
, Seoul, Korea,, 14-19 April 2024
作者:  Zhang Jing;  Tengfei Zhao;  ShiYu Hu;  Xin Zhao
Adobe PDF(966Kb)  |  收藏  |  浏览/下载:44/12  |  提交时间:2024/06/21
Improving Generalization of Multi-agent Reinforcement Learning through Domain-Invariant Feature Extraction 会议论文
, Greece, 2023-5
作者:  Xu YF(徐一凡);  Pu ZQ(蒲志强);  Cai QA(蔡奇昂);  Li FM(李非墨);  Chai XH(柴兴华)
Adobe PDF(7610Kb)  |  收藏  |  浏览/下载:24/11  |  提交时间:2024/06/21
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:45/16  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning  
Multi-Scale Dynamic Coding Improved Spiking Actor Network for Reinforcement Learning 会议论文
, Online, February 22–March 1, 2022
作者:  Zhang, Duzhen;  Zhang, Tielin;  Jia, Shuncheng;  Xu, Bo
Adobe PDF(2249Kb)  |  收藏  |  浏览/下载:37/14  |  提交时间:2024/06/11
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
作者:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  收藏  |  浏览/下载:66/22  |  提交时间:2024/06/05