CASIA OpenIR

浏览/检索结果: 共539条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:6/3  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Network Group Partition and Core Placement Optimization for Neuromorphic Multi-Core and Multi-Chip Systems 期刊论文
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 页码: 16
作者:  Yang, Yukuan;  Fan, Qihang;  Yan, Tianyi;  Pei, Jing;  Li, Guoqi
收藏  |  浏览/下载:1/0  |  提交时间:2024/07/03
Multicore processing  Optimization  System recovery  Throughput  Neuromorphics  Hardware  Costs  Network group partition  core placement optimization  neuromorphic chips  multi-core and multi-chip systems  
Dynamic datasets and market environments for financial reinforcement learning 期刊论文
MACHINE LEARNING, 2024, 页码: 45
作者:  Liu, Xiao-Yang;  Xia, Ziyi;  Yang, Hongyang;  Gao, Jiechao;  Zha, Daochen;  Zhu, Ming;  Wang, Christina Dan;  Wang, Zhaoran;  Guo, Jian
收藏  |  浏览/下载:0/0  |  提交时间:2024/07/03
Financial reinforcement learning  FinRL  Dynamic dataset  Market environment  AI4Finance  Open finance  
UNSUPERVISED LEARNING OF NEURAL SEMANTIC MAPPINGS WITH THE HUNGARIAN ALGORITHM FOR COMPOSITIONAL SEMANTICS 会议论文
, Seoul, South Korea, 2024-04
作者:  Zhang X(张翔);  He SZ(何世柱);  Liu K(刘康);  Zhao J(赵军)
Adobe PDF(294Kb)  |  收藏  |  浏览/下载:12/8  |  提交时间:2024/06/27
On the Effects of Structural Modeling for Neural Semantic Parsing 会议论文
Proceedings of the 27th Conference on Computational Natural Language Learning (CoNLL), Singapore, Singapore, 2023-12
作者:  Zhang X(张翔);  He SZ(何世柱);  Liu K(刘康);  Zhao J(赵军)
Adobe PDF(730Kb)  |  收藏  |  浏览/下载:12/7  |  提交时间:2024/06/27
AdaNSP: Uncertainty-driven Adaptive Decoding in Neural Semantic Parsing 会议论文
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, 2019-07
作者:  Zhang X(张翔);  He SZ(何世柱);  Liu K(刘康);  Zhao J(赵军)
Adobe PDF(400Kb)  |  收藏  |  浏览/下载:19/6  |  提交时间:2024/06/26
Online Optimization of Normalized CPGs for a Multi-Joint Robotic Fish 会议论文
, 中国,上海, 2021年7月
作者:  Tong R(仝茹);  Wu ZX(吴正兴);  Wang J(王健);  Tan M(谭民);  Yu JZ(喻俊志)
Adobe PDF(456Kb)  |  收藏  |  浏览/下载:12/6  |  提交时间:2024/06/26
Visual Pencil: Design of Portable Human-Computer Interaction Based on 2D Visual Tracking 会议论文
, 北京, 2020年10月
作者:  Tong R(仝茹);  Wang TZ(王天柱);  Yu JZ(喻俊志)
Adobe PDF(185Kb)  |  收藏  |  浏览/下载:9/2  |  提交时间:2024/06/26
Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文
, 日本, 2024-6
作者:  Zhang Qingyang;  Xu Bo
Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:9/4  |  提交时间:2024/06/25
强化学习,分层强化学习  
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:14/5  |  提交时间:2024/06/25
强化学习,分层强化学习