CASIA OpenIR

浏览/检索结果: 共262条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Lazy Agents: A New Perspective on Solving Sparse Reward Problem in Multi-agent Reinforcement Learning 期刊
创刊日期: 2018,
主办者:  Liu BY(刘博寅)
Adobe PDF(5797Kb)  |  收藏  |  浏览/下载:22/5  |  提交时间:2024/07/12
Learning to Play Football from Sports Perspective: A Knowledge-embedded Deep Reinforcement Learning Framework 期刊论文
IEEE Transactions on Games, 2022, 页码: 12
作者:  Liu BY(刘博寅)
Adobe PDF(2957Kb)  |  收藏  |  浏览/下载:22/5  |  提交时间:2024/07/12
QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 12
作者:  Liu BY(刘博寅)
Adobe PDF(6675Kb)  |  收藏  |  浏览/下载:16/2  |  提交时间:2024/07/12
Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文
, Nanjing, 2023-11-27
作者:  Yuqiao Wu;  Haifeng Zhang;  Jun Wang
Adobe PDF(1339Kb)  |  收藏  |  浏览/下载:19/4  |  提交时间:2024/07/12
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:30/12  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Advancing Spiking Neural Networks Toward Deep Residual Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 15
作者:  Hu, Yifan;  Deng, Lei;  Wu, Yujie;  Yao, Man;  Li, Guoqi
收藏  |  浏览/下载:5/0  |  提交时间:2024/07/03
Degradation problem  neuromorphic computing  residual neural network  spiking neural network (SNN)  
On the Effects of Structural Modeling for Neural Semantic Parsing 会议论文
Proceedings of the 27th Conference on Computational Natural Language Learning (CoNLL), Singapore, Singapore, 2023-12
作者:  Zhang X(张翔);  He SZ(何世柱);  Liu K(刘康);  Zhao J(赵军)
Adobe PDF(730Kb)  |  收藏  |  浏览/下载:32/19  |  提交时间:2024/06/27
AdaNSP: Uncertainty-driven Adaptive Decoding in Neural Semantic Parsing 会议论文
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, 2019-07
作者:  Zhang X(张翔);  He SZ(何世柱);  Liu K(刘康);  Zhao J(赵军)
Adobe PDF(400Kb)  |  收藏  |  浏览/下载:28/9  |  提交时间:2024/06/26
Online Optimization of Normalized CPGs for a Multi-Joint Robotic Fish 会议论文
, 中国,上海, 2021年7月
作者:  Tong R(仝茹);  Wu ZX(吴正兴);  Wang J(王健);  Tan M(谭民);  Yu JZ(喻俊志)
Adobe PDF(456Kb)  |  收藏  |  浏览/下载:27/15  |  提交时间:2024/06/26
Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文
, 日本, 2024-6
作者:  Zhang Qingyang;  Xu Bo
Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:20/7  |  提交时间:2024/06/25
强化学习,分层强化学习