CASIA OpenIR

浏览/检索结果: 共354条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 12
作者:  Liu BY(刘博寅)
Adobe PDF(6675Kb)  |  收藏  |  浏览/下载:16/2  |  提交时间:2024/07/12
Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文
, Nanjing, 2023-11-27
作者:  Yuqiao Wu;  Haifeng Zhang;  Jun Wang
Adobe PDF(1339Kb)  |  收藏  |  浏览/下载:19/4  |  提交时间:2024/07/12
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:33/15  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Network Group Partition and Core Placement Optimization for Neuromorphic Multi-Core and Multi-Chip Systems 期刊论文
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 页码: 16
作者:  Yang, Yukuan;  Fan, Qihang;  Yan, Tianyi;  Pei, Jing;  Li, Guoqi
收藏  |  浏览/下载:8/0  |  提交时间:2024/07/03
Multicore processing  Optimization  System recovery  Throughput  Neuromorphics  Hardware  Costs  Network group partition  core placement optimization  neuromorphic chips  multi-core and multi-chip systems  
DRL-Based Adaptive Sharding for Blockchain-Based Federated Learning 期刊论文
IEEE TRANSACTIONS ON COMMUNICATIONS, 2023, 卷号: 71, 期号: 10, 页码: 5992-6004
作者:  Lin, Yijing;  Gao, Zhipeng;  Du, Hongyang;  Kang, Jiawen;  Niyato, Dusit;  Wang, Qian;  Ruan, Jingqing;  Wan, Shaohua
收藏  |  浏览/下载:3/0  |  提交时间:2024/07/03
Blockchain sharding  federated learning  reputation  deep reinforcement learning  
Dynamic datasets and market environments for financial reinforcement learning 期刊论文
MACHINE LEARNING, 2024, 页码: 45
作者:  Liu, Xiao-Yang;  Xia, Ziyi;  Yang, Hongyang;  Gao, Jiechao;  Zha, Daochen;  Zhu, Ming;  Wang, Christina Dan;  Wang, Zhaoran;  Guo, Jian
收藏  |  浏览/下载:4/0  |  提交时间:2024/07/03
Financial reinforcement learning  FinRL  Dynamic dataset  Market environment  AI4Finance  Open finance  
SELF-SUPERVISED MATCHING NETWORK BASED ON FREQUENCY DOMAIN INFORMATION GUIDANCE FOR REMOTE SENSING IMAGE REGISTRATION 会议论文
, Athens, Greece, Jul 7, 2024 - Jul 12, 2024
作者:  Zhou YX(周雨欣);  Wan L(万玲);  Ma L(马雷)
Adobe PDF(11699Kb)  |  收藏  |  浏览/下载:15/6  |  提交时间:2024/07/01
Online Optimization of Normalized CPGs for a Multi-Joint Robotic Fish 会议论文
, 中国,上海, 2021年7月
作者:  Tong R(仝茹);  Wu ZX(吴正兴);  Wang J(王健);  Tan M(谭民);  Yu JZ(喻俊志)
Adobe PDF(456Kb)  |  收藏  |  浏览/下载:27/15  |  提交时间:2024/06/26
Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文
, 日本, 2024-6
作者:  Zhang Qingyang;  Xu Bo
Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:20/7  |  提交时间:2024/06/25
强化学习,分层强化学习  
MULFE: A Multi-Level Benchmark for Free Text Model Editing 会议论文
, Bangkok, Thailand, 2024-08
作者:  Wang, Chenhao;  Cao, Pengfei;  Jin, Zhuoran;  Chen, Yubo;  Zeng, Daojian;  Liu, Kang;  Zhao, Jun
Adobe PDF(571Kb)  |  收藏  |  浏览/下载:19/8  |  提交时间:2024/06/25