CASIA OpenIR

浏览/检索结果: 共796条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Learning Top-K Subtask Planning Tree Based on Discriminative Representation Pretraining for Decision-making 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 782-800
作者:  Jingqing Ruan;   Kaishen Wang;   Qingyang Zhang;   Dengpeng Xing;   Bo Xu
Adobe PDF(4577Kb)  |  收藏  |  浏览/下载:27/12  |  提交时间:2024/07/18
Reinforcement learning  representation learning  subtask planning  task decomposition  pretraining.  
Privacy Protection for Blockchain-Based Healthcare IoT Systems: A Survey 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 8, 页码: 1757-1776
作者:  Minfeng Qi;  Ziyuan Wang;  Qing-Long Han;  Jun Zhang;  Shiping Chen;  Yang Xiang
Adobe PDF(2394Kb)  |  收藏  |  浏览/下载:19/8  |  提交时间:2024/07/16
Blockchain  internet of healthcare things (IoHT)  privacy-preserving techniques (PPTs)  
Tacit Commitments Emergence in Multi-agent Reinforcement Learning 会议论文
, New Delhi, India, 2023-7
作者:  Liu BY(刘博寅);  Zhiqiang Pu;  Junlong Gao;  Jianqiang Yi;  Zhenyu Guo
Adobe PDF(932Kb)  |  收藏  |  浏览/下载:28/10  |  提交时间:2024/07/15
Lazy Agents: A New Perspective on Solving Sparse Reward Problem in Multi-agent Reinforcement Learning 期刊
创刊日期: 2018,
主办者:  Liu BY(刘博寅)
Adobe PDF(5797Kb)  |  收藏  |  浏览/下载:31/8  |  提交时间:2024/07/12
QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 12
作者:  Liu BY(刘博寅)
Adobe PDF(6675Kb)  |  收藏  |  浏览/下载:29/5  |  提交时间:2024/07/12
基于深度强化学习的足球智能体球员策略方法研究 学位论文
, 2024
作者:  刘博寅
Adobe PDF(11380Kb)  |  收藏  |  浏览/下载:60/0  |  提交时间:2024/07/12
足球  多智能体系统  深度强化学习  互信息  内在激励  预训练  
Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文
, Nanjing, 2023-11-27
作者:  Yuqiao Wu;  Haifeng Zhang;  Jun Wang
Adobe PDF(1339Kb)  |  收藏  |  浏览/下载:33/10  |  提交时间:2024/07/12
Autonomous Driving in Underground Mines via Parallel Driving Operation Systems: Challenges, Frameworks and Cases Study 期刊论文
IEEE Transactions on Intelligent Vehicles, 2024, 页码: 1-10
作者:  Bin Tian;  Caiji Zhang;  Xuedi Hao;  Shi Meng;  Shibin Wang;  Zheng Yang;  Long Chen;  Yanlong Zhao;  Shirong Ge
Adobe PDF(11335Kb)  |  收藏  |  浏览/下载:64/9  |  提交时间:2024/07/05
NeuronsMAE: A Novel Multi-Agent Reinforcement Learning Environment for Cooperative and Competitive Multi-Robot Tasks 会议论文
, Queensland, Australia, 2023-6
作者:  Hu GZ(胡光政);  Li HR(李浩然);  Liu SS(刘莎莎);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(2785Kb)  |  收藏  |  浏览/下载:41/11  |  提交时间:2024/07/04
基于强化学习动作空间精简的时序决策任务算法研究 学位论文
, 2024
作者:  王梓薏
Adobe PDF(7273Kb)  |  收藏  |  浏览/下载:44/1  |  提交时间:2024/07/04
时序决策  强化学习  动作空间约简  分层强化学习  动作掩码