CASIA OpenIR

浏览/检索结果: 共4条,第1-4条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:163/9  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system  
Learning Deep Decentralized Policy Network by Collective Rewards for Real-Time Combat Game 会议论文
, Macao, China, August 10-16, 2019
作者:  Peixi Peng;  Junliang Xing;  Lili Cao;  Lisen Mu;  Chang Huang
Adobe PDF(762Kb)  |  收藏  |  浏览/下载:361/134  |  提交时间:2019/10/10
Multi-agent Learning  Deep Decentralized Policy Network  Real-time Combat Game  
Incremental Codebook Adaptation for Visual Representation and Categorization 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 7, 页码: 2012-2023
作者:  Zhang, Chunjie;  Cheng, Jian;  Tian, Qi
Adobe PDF(2354Kb)  |  收藏  |  浏览/下载:623/259  |  提交时间:2017/09/14
Codebook learning  low-rank  sparse coding  visual representation  
Adding Active Learning to LWR for Ping-Pong Playing Robot 期刊论文
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2013, 卷号: 21, 期号: 4, 页码: 1489-1494
作者:  Huang, Yanlong;  Xu, De;  Tan, Min;  Su, Hu
Adobe PDF(414Kb)  |  收藏  |  浏览/下载:353/114  |  提交时间:2015/08/12
Active Learning  Fuzzy Cerebellar Model Articulation Controller (Fcmac)  Lazy Learning  Locally Weighted Regression (Lwr)  Ping-pong Playing Robot