CASIA OpenIR

浏览/检索结果: 共15条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
On the Effects of Structural Modeling for Neural Semantic Parsing 会议论文
Proceedings of the 27th Conference on Computational Natural Language Learning (CoNLL), Singapore, Singapore, 2023-12
作者:  Zhang X(张翔);  He SZ(何世柱);  Liu K(刘康);  Zhao J(赵军)
Adobe PDF(730Kb)  |  收藏  |  浏览/下载:44/23  |  提交时间:2024/06/27
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:49/14  |  提交时间:2024/06/05
Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文
Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8
作者:  He SQ(何少钦);  Gao Y(高阳);  Zhang BF(张保丰);  Chang H(常惠);  Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:70/23  |  提交时间:2024/05/31
Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.  
Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文
, Queensland, Australia, 2023-6
作者:  Yang, Chen;  Yang, Guangkai;  Chen, Hao;  Zhang, Junge
Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:66/23  |  提交时间:2024/05/29
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文
, New Orleans, LA, USA, December 10-16, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Guangchong Zhou;  Zeren Zhang;  Guoliang Fan
Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:59/17  |  提交时间:2024/05/28
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Zeren Zhang;  Guangchong Zhou;  Hao Chen;  Guoliang Fan
Adobe PDF(4141Kb)  |  收藏  |  浏览/下载:53/21  |  提交时间:2024/05/28
Learning to Play Football From Sports Domain Perspective: A Knowledge-Embedded Deep Reinforcement Learning Framework 期刊论文
IEEE TRANSACTIONS ON GAMES, 2023, 卷号: 15, 期号: 4, 页码: 648-657
作者:  Liu, Boyin;  Pu, Zhiqiang;  Zhang, Tianle;  Wang, Huimu;  Yi, Jianqiang;  Mi, Jiachen
收藏  |  浏览/下载:74/0  |  提交时间:2024/02/22
Deformable convolution  football analysis  pitch control  reinforcement learning  
SOTVerse: A User-Defined Task Space of Single Object Tracking 期刊论文
International Journal of Computer Vision, 2023, 卷号: 132, 期号: 3, 页码: 1-59
作者:  Shiyu, Hu;  Xin, Zhao;  Kaiqi Huang
Adobe PDF(53048Kb)  |  收藏  |  浏览/下载:95/11  |  提交时间:2024/01/22
Single object tracking  Experimental environment  Evaluation system  Performance analysis  
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:162/9  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system  
A Torque Control Strategy for a Robotic Dolphin Platform Based on Angle of Attack Feedback 期刊论文
Biomimetics, 2023, 卷号: 8, 页码: 291
作者:  Tianzhu Wang;  Junzhi Yu;  Di Chen;  Yan Meng
Adobe PDF(1925Kb)  |  收藏  |  浏览/下载:182/68  |  提交时间:2023/09/21
robotic dolphin  torque control  angle of attack  motion improvement