CASIA OpenIR

浏览/检索结果: 共44条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:161/37  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Bag of Tricks for Training Data Extraction from Language Models 会议论文
, Hawaii, US, 2023-7
作者:  Yu, Weichen;  Pang, Tianyu;  Liu, Qian;  Du, Chao;  Kang, Bingyi;  Huang, Yan;  Yan, Shuicheng
Adobe PDF(2549Kb)  |  收藏  |  浏览/下载:164/68  |  提交时间:2023/07/03
Pseudo Value Network Distillation for High-Performance Exploration 会议论文
, 澳大利亚, 2023-06
作者:  Zhao EM(赵恩民);  Xing JL(兴军亮);  Li K(李凯);  Kang YX(康永欣);  Tao P(陶品)
Adobe PDF(5874Kb)  |  收藏  |  浏览/下载:132/38  |  提交时间:2023/06/28
Knowledge Transfer from Situation Evaluation to Multi-agent Reinforcement Learning 会议论文
, New Delhi, India, 2022年11月22-2022年11月26
作者:  Chen M(陈敏);  Pu ZQ(蒲志强);  Pan Y(潘一);  Yi JQ(易建强)
Adobe PDF(4734Kb)  |  收藏  |  浏览/下载:137/49  |  提交时间:2023/06/27
Multi-agent reinforcement learning  Transfer learning  
All for Goals: a Stylized Automated Analysis Framework in Football Matches 会议论文
, Gold Coast Convention and Exhibition Centre Queensland, Australia, June 18 - 23, 2023
作者:  Chen M(陈敏);  Pu ZQ(蒲志强);  Pan Y(潘一);  Yi JQ(易建强);  Cui YX(崔一雄);  Lida Du
Adobe PDF(1485Kb)  |  收藏  |  浏览/下载:247/159  |  提交时间:2023/06/28
Self-Adaptive Task Allocation for Decentralized Deep Learning in Heterogeneous Environments 会议论文
Proceedings of the International Conference on Software Engineering and Knowledge Engineering, SEKE, 线上会议, 2022-7-1至20227-10
作者:  Chao, Yongyue;  Liao, Mingxue;  Gao, Jiaxin;  Li,Guangyao
Adobe PDF(1976Kb)  |  收藏  |  浏览/下载:109/41  |  提交时间:2023/06/19
Second-Order Global Attention Networks for Graph Classification and Regression 会议论文
, Beijing, China, August 27-28, 2022
作者:  Hu Fenyu;  Cui Zeyu;  Wu Shu;  Liu Qiang;  Wu Jinlin;  Wang Liang;  Tan Tieniu
Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:176/67  |  提交时间:2023/07/06
A time-series augmentation method based on empirical mode decomposition and integrated LSTM neural network 会议论文
, Glasgow, 2022-07
作者:  chenguang li;  hongjun yang;  long cheng
Adobe PDF(1725Kb)  |  收藏  |  浏览/下载:79/32  |  提交时间:2023/06/25
DPNAS: Neural Architecture Search for Deep Learning with Differential Privacy 会议论文
, 线上, 2022-2
作者:  Cheng AD(程安达);  Wang JX(王家兴);  Zhang X(张希);  Chen Q(谌强);  Wang PS(王培松);  Cheng J(程健)
Adobe PDF(1135Kb)  |  收藏  |  浏览/下载:88/21  |  提交时间:2023/06/05
Differentially Private Federated Learning with Local Regularization and Sparsification 会议论文
, 线上, 2022-6
作者:  Cheng AD(程安达);  Wang PS(王培松);  Zhang X(张希);  Cheng J(程健)
Adobe PDF(312Kb)  |  收藏  |  浏览/下载:108/42  |  提交时间:2023/06/05