CASIA OpenIR

浏览/检索结果: 共57条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:48/17  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning  
Learning Superior Cooperative Policy in Competitive Multi-team Reinforcement Learning 会议论文
, Gold Coast, Australia, 2023-6
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Xiaolin Ai;  Wanmai Yuan
Adobe PDF(25675Kb)  |  收藏  |  浏览/下载:47/10  |  提交时间:2024/06/05
Continuous Exploration via Multiple Perspectives in Sparse Reward Environment 会议论文
, 厦门国际会议中心, 2023-10-13
作者:  Chen ZP(陈忠鹏);  Guan Q(关强)
Adobe PDF(2260Kb)  |  收藏  |  浏览/下载:39/12  |  提交时间:2024/06/04
Reinforcement Learning · Exploration Strategy · Sparse Reward · Intrinsic Motivation  
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning 会议论文
, New Orleans, LA, USA,, November 28 - December 9, 2022
作者:  Zhiwei Xu;  Dapeng Li;  Bin Zhang;  Yuan Zhan;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(4367Kb)  |  收藏  |  浏览/下载:38/7  |  提交时间:2024/05/28
A Performance Optimization Strategy Based on Improved NSGA-II for a Flexible Robotic Fish 会议论文
, 英国伦敦, 2023.5.29
作者:  Lu, Ben;  Wang, Jian;  Liao, Xiaocun;  Zou, Qianqian;  Tan, Min;  Zhou, Chao
Adobe PDF(1449Kb)  |  收藏  |  浏览/下载:69/19  |  提交时间:2024/05/28
基于噪声对比估计的权重自适应对抗生成式模仿学习 期刊论文
模式识别与人工智能, 2023, 卷号: 36, 期号: 4, 页码: 300-312
作者:  关伟凡;  张希
Adobe PDF(1849Kb)  |  收藏  |  浏览/下载:144/49  |  提交时间:2023/06/29
强化学习  模仿学习  噪声对比估计  自适应权重  
Energy Based Optimal Dynamic Stealth False Data Injection Attacks on the Smart Grid 会议论文
Proceedings of 2020 7th International Conference on Information, Cybernetics, and Computational Social Systems, Guangzhou, China, 2020.11.13-15
作者:  Liu, Yifa;  Cheng, Long
Adobe PDF(1191Kb)  |  收藏  |  浏览/下载:149/51  |  提交时间:2023/06/29
smart grid, security, false data injection attack, optimal control  
Potential Driven Reinforcement Learning for Hard Exploration Tasks 会议论文
, 线上, 2020-4
作者:  Zhao EM(赵恩民);  Deng SH(邓诗弘);  Zang YF(臧一凡);  Kang YX(康永欣);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(1999Kb)  |  收藏  |  浏览/下载:124/46  |  提交时间:2023/06/29
Evolution of opinions with estimation and interference 会议论文
Proceedings of 41st Chinese Control Conference, Hefei, 2022.7.25-27
作者:  Liu, Yifa;  Cheng, Long
Adobe PDF(214Kb)  |  收藏  |  浏览/下载:161/56  |  提交时间:2023/06/28
Opinion dynamics, Self-cognition, Estimation  
Pseudo Value Network Distillation for High-Performance Exploration 会议论文
, 澳大利亚, 2023-06
作者:  Zhao EM(赵恩民);  Xing JL(兴军亮);  Li K(李凯);  Kang YX(康永欣);  Tao P(陶品)
Adobe PDF(5874Kb)  |  收藏  |  浏览/下载:161/45  |  提交时间:2023/06/28