CASIA OpenIR

浏览/检索结果: 共41条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文
, Nanjing, 2023-11-27
作者:  Yuqiao Wu;  Haifeng Zhang;  Jun Wang
Adobe PDF(1339Kb)  |  收藏  |  浏览/下载:25/8  |  提交时间:2024/07/12
DRL-Based Adaptive Sharding for Blockchain-Based Federated Learning 期刊论文
IEEE TRANSACTIONS ON COMMUNICATIONS, 2023, 卷号: 71, 期号: 10, 页码: 5992-6004
作者:  Lin, Yijing;  Gao, Zhipeng;  Du, Hongyang;  Kang, Jiawen;  Niyato, Dusit;  Wang, Qian;  Ruan, Jingqing;  Wan, Shaohua
收藏  |  浏览/下载:4/0  |  提交时间:2024/07/03
Blockchain sharding  federated learning  reputation  deep reinforcement learning  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:22/10  |  提交时间:2024/06/25
Improving Generalization of Multi-agent Reinforcement Learning through Domain-Invariant Feature Extraction 会议论文
, Greece, 2023-5
作者:  Xu YF(徐一凡);  Pu ZQ(蒲志强);  Cai QA(蔡奇昂);  Li FM(李非墨);  Chai XH(柴兴华)
Adobe PDF(7610Kb)  |  收藏  |  浏览/下载:26/11  |  提交时间:2024/06/21
P-vectors: A Parallel-Coupled TDNN/Transformer Network for Speaker Verification 会议论文
, Dublin, Ireland, 2023.08.24
作者:  Wang XY(王溪源);  Wang FY(王方圆);  Xu B(徐波);  Xu L(徐亮);  Xiao J(肖京)
Adobe PDF(1542Kb)  |  收藏  |  浏览/下载:61/15  |  提交时间:2024/06/12
Generative Calibration for In-context Learning 会议论文
, Singapore, 2023-10-6
作者:  Zhongtao Jiang;  Yuanzhe Zhang;  Cao Liu;  Jun Zhao;  Kang Liu
Adobe PDF(763Kb)  |  收藏  |  浏览/下载:43/20  |  提交时间:2024/06/06
Interpreting Sentiment Composition with Latent Semantic Tree 会议论文
, Toronto, Canada, 2023-7-9
作者:  Zhongtao Jiang;  Yuanzhe Zhang;  Cao Liu;  Jiansong Chen;  Jun Zhao;  Kang Liu
Adobe PDF(509Kb)  |  收藏  |  浏览/下载:49/20  |  提交时间:2024/06/06
Learning Superior Cooperative Policy in Competitive Multi-team Reinforcement Learning 会议论文
, Gold Coast, Australia, 2023-6
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Xiaolin Ai;  Wanmai Yuan
Adobe PDF(25675Kb)  |  收藏  |  浏览/下载:47/10  |  提交时间:2024/06/05
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:44/12  |  提交时间:2024/06/05
Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文
Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8
作者:  He SQ(何少钦);  Gao Y(高阳);  Zhang BF(张保丰);  Chang H(常惠);  Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:64/21  |  提交时间:2024/05/31
Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.