CASIA OpenIR

浏览/检索结果: 共93条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文
Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8
作者:  He SQ(何少钦);  Gao Y(高阳);  Zhang BF(张保丰);  Chang H(常惠);  Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:11/4  |  提交时间:2024/05/31
Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.  
Adaptive Multilingual Representations for Cross-Lingual Entity Linking with Attention on Entity Descriptions 会议论文
, Hangzhou, China, 2019-8
作者:  Wang, Chenhao;  Chen, Yubo;  Liu, Kang;  Zhao, Jun
Adobe PDF(1745Kb)  |  收藏  |  浏览/下载:7/1  |  提交时间:2024/05/30
Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文
, Queensland, Australia, 2023-6
作者:  Yang, Chen;  Yang, Guangkai;  Chen, Hao;  Zhang, Junge
Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:8/2  |  提交时间:2024/05/29
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文
, New Orleans, LA, USA, December 10-16, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Guangchong Zhou;  Zeren Zhang;  Guoliang Fan
Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:7/0  |  提交时间:2024/05/28
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Shenzhen, China, 18-22 July 2021
作者:  Zhiwei Xu;  Dapeng Li;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(3892Kb)  |  收藏  |  浏览/下载:7/2  |  提交时间:2024/05/28
A Multi-modal Global Instance Tracking Benchmark (MGIT): Better Locating Target in Complex Spatio-temporal and causal Relationship 会议论文
, New Orleans, 2023-12
作者:  Shiyu, Hu;  Dailing, Zhang;  Meiqi, Wu;  Xiaokun, Feng;  Xuchen, Li;  Xin, Zhao;  Kaiqi, Huang
Adobe PDF(6215Kb)  |  收藏  |  浏览/下载:88/18  |  提交时间:2024/01/22
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:175/40  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
A Novel Underwater Image Synthesis Method Based on a Pixel-level Self-Supervised Training Strategy 会议论文
, Xining, China, 2021-7
作者:  Zhiheng Wu;  Zhengxing Wu;  Yue Lu;  Jian Wang;  Junzhi Yu
Adobe PDF(1862Kb)  |  收藏  |  浏览/下载:125/45  |  提交时间:2023/06/29
Evolution of opinions with estimation and interference 会议论文
Proceedings of 41st Chinese Control Conference, Hefei, 2022.7.25-27
作者:  Liu, Yifa;  Cheng, Long
Adobe PDF(214Kb)  |  收藏  |  浏览/下载:127/47  |  提交时间:2023/06/28
Opinion dynamics, Self-cognition, Estimation  
Commander-Soldiers Reinforcement Learning for Cooperative Multi-Agent Systems 会议论文
, 意大利, 2022-7
作者:  Chen YQ(陈逸群);  Yang Wei;  Tianle Zhang;  Shiguang Wu;  Hongxing Chang
Adobe PDF(15907Kb)  |  收藏  |  浏览/下载:141/32  |  提交时间:2023/06/28