CASIA OpenIR

浏览/检索结果: 共4条,第1-4条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文
, Nanjing, 2023-11-27
作者:  Yuqiao Wu;  Haifeng Zhang;  Jun Wang
Adobe PDF(1339Kb)  |  收藏  |  浏览/下载:33/10  |  提交时间:2024/07/12
基于门限和环签名的抗自适应攻击拜占庭容错共识算法 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 7, 页码: 1471-1482
作者:  孙海锋;  张文芳;  王小敏;  马征;  黄路非;  李暄
Adobe PDF(2182Kb)  |  收藏  |  浏览/下载:72/24  |  提交时间:2024/04/25
区块链  拜占庭容错  共识算法  自适应攻击  环签名  门限签名  
Offline Pre-trained Multi-agent Decision Transformer 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 233-248
作者:  Linghui Meng;  Muning Wen;  Chenyang Le;  Xiyun Li;  Dengpeng Xing;  Weinan Zhang;  Ying Wen;  Haifeng Zhang;  Jun Wang;  Yaodong Yang;  Bo Xu
Adobe PDF(2121Kb)  |  收藏  |  浏览/下载:72/17  |  提交时间:2024/04/23
Pre-training model  multi-agent reinforcement learning (MARL)  decision making  transformer  offline reinforcement learning  
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:164/9  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system