已选(0)清除
条数/页: 排序方式: |
| Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文 , Nanjing, 2023-11-27 作者: Yuqiao Wu ; Haifeng Zhang; Jun Wang
Adobe PDF(1339Kb)  |   收藏  |  浏览/下载:25/8  |  提交时间:2024/07/12 |
| DRL-Based Adaptive Sharding for Blockchain-Based Federated Learning 期刊论文 IEEE TRANSACTIONS ON COMMUNICATIONS, 2023, 卷号: 71, 期号: 10, 页码: 5992-6004 作者: Lin, Yijing; Gao, Zhipeng; Du, Hongyang; Kang, Jiawen; Niyato, Dusit; Wang, Qian ; Ruan, Jingqing; Wan, Shaohua
![](/themes/default/image/downing1.png) 收藏  |  浏览/下载:4/0  |  提交时间:2024/07/03 Blockchain sharding federated learning reputation deep reinforcement learning |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang ; Zhang Hongming; Xing Dengpeng ; Bo Xu![](/image/person.jpg)
Adobe PDF(9639Kb)  |   收藏  |  浏览/下载:22/10  |  提交时间:2024/06/25 |
| Improving Generalization of Multi-agent Reinforcement Learning through Domain-Invariant Feature Extraction 会议论文 , Greece, 2023-5 作者: Xu YF(徐一凡) ; Pu ZQ(蒲志强) ; Cai QA(蔡奇昂) ; Li FM(李非墨) ; Chai XH(柴兴华)
Adobe PDF(7610Kb)  |   收藏  |  浏览/下载:26/11  |  提交时间:2024/06/21 |
| P-vectors: A Parallel-Coupled TDNN/Transformer Network for Speaker Verification 会议论文 , Dublin, Ireland, 2023.08.24 作者: Wang XY(王溪源) ; Wang FY(王方圆) ; Xu B(徐波) ; Xu L(徐亮); Xiao J(肖京)
Adobe PDF(1542Kb)  |   收藏  |  浏览/下载:61/15  |  提交时间:2024/06/12 |
| Generative Calibration for In-context Learning 会议论文 , Singapore, 2023-10-6 作者: Zhongtao Jiang ; Yuanzhe Zhang ; Cao Liu ; Jun Zhao ; Kang Liu
Adobe PDF(763Kb)  |   收藏  |  浏览/下载:43/20  |  提交时间:2024/06/06 |
| Interpreting Sentiment Composition with Latent Semantic Tree 会议论文 , Toronto, Canada, 2023-7-9 作者: Zhongtao Jiang ; Yuanzhe Zhang ; Cao Liu ; Jiansong Chen; Jun Zhao ; Kang Liu
Adobe PDF(509Kb)  |   收藏  |  浏览/下载:49/20  |  提交时间:2024/06/06 |
| Learning Superior Cooperative Policy in Competitive Multi-team Reinforcement Learning 会议论文 , Gold Coast, Australia, 2023-6 作者: Qingxu Fu ; Tenghai Qiu ; Zhiqiang Pu ; Jianqiang Yi ; Xiaolin Ai ; Wanmai Yuan
Adobe PDF(25675Kb)  |   收藏  |  浏览/下载:47/10  |  提交时间:2024/06/05 |
| Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文 IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040 作者: Qingxu, Fu ; Xiaolin Ai ; Jianqiang Yi ; Tenghai Qiu ; Wanmai Yuan; Zhiqiang Pu![](/image/person.jpg)
Adobe PDF(996Kb)  |   收藏  |  浏览/下载:44/12  |  提交时间:2024/06/05 |
| Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文 Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8 作者: He SQ(何少钦) ; Gao Y(高阳) ; Zhang BF(张保丰); Chang H(常惠) ; Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |   收藏  |  浏览/下载:64/21  |  提交时间:2024/05/31 Air Combat, Reinforcement Learning, Neural Fictitious Self-Play. |