CASIA OpenIR

浏览/检索结果: 共12条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:42/18  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:50/18  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning  
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:46/13  |  提交时间:2024/06/05
Human-robot object handover: Recent progress and future direction 期刊论文
Biomimetic Intelligence and Robotics, 2024, 卷号: 4, 页码: 100145
作者:  Duan, Haonan;  Yang, Yifan;  Li, Daheng;  Wang, Peng
Adobe PDF(1839Kb)  |  收藏  |  浏览/下载:49/17  |  提交时间:2024/05/29
Human–robot interactions  Object handover  
Optimal Strategy for Aircraft Pursuit-Evasion Games via Self-Play Iteration 期刊论文
Machine Intelligence Research, 2023, 页码: 1-12
作者:  Wang Xin;  Wei Qinglai;  Li Tao;  Zhang Jie
Adobe PDF(1556Kb)  |  收藏  |  浏览/下载:216/83  |  提交时间:2023/06/26
知识和数据协同驱动的群体智能决策方法研究综述 期刊论文
自动化学报, 2022, 卷号: 48, 期号: 3, 页码: 1-17
作者:  蒲志强;  易建强;  刘振;  丘腾海;  孙金林;  李非墨
Adobe PDF(1352Kb)  |  收藏  |  浏览/下载:355/87  |  提交时间:2022/04/02
群体智能  知识与数据协同  多智能体  决策智能  
Multi-Agent Hierarchical Cognition Difference Policy for Multi-Agent Cooperation 期刊论文
Algorithms, 2021, 期号: 14, 页码: 98
作者:  Huimu Wang;  Zhen Liu;  Jianqiang Yi;  Zhiqiang Pu
Adobe PDF(1155Kb)  |  收藏  |  浏览/下载:273/57  |  提交时间:2021/06/24
multiagent system  deep reinforcement learning  variational autoencoder  attention mechanism  
一种针对德州扑克AI的对手建模与策略集成框架 期刊论文
自动化学报, 2021, 期号: 0, 页码: 0
作者:  张蒙;  李凯;  吴哲;  臧一凡;  徐航;  兴军亮
Adobe PDF(1354Kb)  |  收藏  |  浏览/下载:443/125  |  提交时间:2021/06/21
不完美信息博弈  德州扑克  演化学习  在线对手建模  种群策略集成  
Parallel Crime Scene Analysis Based on ACP Approach 期刊论文
IEEE Transactions on Computational Social Systems, 2018, 卷号: 5, 期号: 1, 页码: 244-255
作者:  Wang, Shuai;  Wang, Xiao;  Ye, Peijun;  Yuan, Yong;  Liu, Shuo;  Wang, Feiyue
浏览  |  Adobe PDF(2590Kb)  |  收藏  |  浏览/下载:293/88  |  提交时间:2019/11/12
Artificial Societies  Computational Experiments  Parallel Execution  Crime Scene Analysis  Parallel Systems  
Generative Adversarial Networks: Introduction and Outlook 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2017, 卷号: 4, 期号: 4, 页码: 588-598
作者:  Kunfeng Wang;  Chao Gou;  Yanjie Duan;  Yilun Lin;  Xinhu Zheng;  Fei-Yue Wang
浏览  |  Adobe PDF(16945Kb)  |  收藏  |  浏览/下载:386/47  |  提交时间:2018/01/08
Acp Approach  Adversarial Learning  Generative Adversarial Networks (Gans)  Generative Models  Parallel Intelligence  Zero-sum Game