CASIA OpenIR

浏览/检索结果: 共14条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:52/23  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:29/13  |  提交时间:2024/06/25
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:58/22  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning  
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:50/15  |  提交时间:2024/06/05
Learning Playing Piano with Bionic-Constrained Diffusion Policy for Anthropomorphic Hand 期刊论文
Cyborg and Bionic Systems, 2024, 卷号: 5, 页码: 0104
作者:  Yang YM(杨依明);  Wang ZC(王泽昌);  Xing DP(邢登鹏);  Wang P(王鹏)
Adobe PDF(3500Kb)  |  收藏  |  浏览/下载:42/17  |  提交时间:2024/05/30
Human-robot object handover: Recent progress and future direction 期刊论文
Biomimetic Intelligence and Robotics, 2024, 卷号: 4, 页码: 100145
作者:  Duan, Haonan;  Yang, Yifan;  Li, Daheng;  Wang, Peng
Adobe PDF(1839Kb)  |  收藏  |  浏览/下载:54/18  |  提交时间:2024/05/29
Human–robot interactions  Object handover  
Multi-Agent Hierarchical Cognition Difference Policy for Multi-Agent Cooperation 期刊论文
Algorithms, 2021, 期号: 14, 页码: 98
作者:  Huimu Wang;  Zhen Liu;  Jianqiang Yi;  Zhiqiang Pu
Adobe PDF(1155Kb)  |  收藏  |  浏览/下载:279/62  |  提交时间:2021/06/24
multiagent system  deep reinforcement learning  variational autoencoder  attention mechanism  
Robust Object Tracking via Information Theoretic Measures 期刊论文
International Journal of Automation and Computing, 2020, 期号: 17, 页码: 1
作者:  Wang, Weining;  Li, Qi;  Wang, Liang
浏览  |  Adobe PDF(9045Kb)  |  收藏  |  浏览/下载:225/66  |  提交时间:2020/09/03
Object tracking, information theoretic measures, correntropy, template update, robust to complex noises  
数字孪生与平行系统: 发展现状、对比及展望 期刊论文
自动化学报, 2019, 卷号: 45, 期号: 11, 页码: 2001-2031
作者:  杨林瑶;  陈思远;  王晓;  张俊;  王成红
浏览  |  Adobe PDF(13977Kb)  |  收藏  |  浏览/下载:1069/515  |  提交时间:2020/03/17
 数字孪生, 平行系统, 复杂系统管理与控制, 人工智能, 虚实交互  
Parallel Crime Scene Analysis Based on ACP Approach 期刊论文
IEEE Transactions on Computational Social Systems, 2018, 卷号: 5, 期号: 1, 页码: 244-255
作者:  Wang, Shuai;  Wang, Xiao;  Ye, Peijun;  Yuan, Yong;  Liu, Shuo;  Wang, Feiyue
浏览  |  Adobe PDF(2590Kb)  |  收藏  |  浏览/下载:296/89  |  提交时间:2019/11/12
Artificial Societies  Computational Experiments  Parallel Execution  Crime Scene Analysis  Parallel Systems