CASIA OpenIR

浏览/检索结果: 共38条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Learning to Play Football from Sports Perspective: A Knowledge-embedded Deep Reinforcement Learning Framework 期刊论文
IEEE Transactions on Games, 2022, 页码: 12
作者:  Liu BY(刘博寅)
Adobe PDF(2957Kb)  |  收藏  |  浏览/下载:31/6  |  提交时间:2024/07/12
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:48/17  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning  
Fuzzy Feedback Multi-Agent Reinforcement Learning for Adversarial Dynamic Multi-Team Competitions 期刊论文
IEEE Transactions on Fuzzy Systems, 2024, 页码: 1
作者:  Qingxu Fu;  Zhiqiang Pu;  Yi Pan;  Tenghai Qiu;  Jianqiang Yi
Adobe PDF(4975Kb)  |  收藏  |  浏览/下载:45/14  |  提交时间:2024/06/05
Towards Better Quantity Representations for Solving Math Word Problems 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 2024, 页码: -
作者:  Sun, Runxin;  He, Shizhu;  Zhao, Jun;  Liu, Kang
Adobe PDF(417Kb)  |  收藏  |  浏览/下载:53/20  |  提交时间:2024/05/28
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:154/5  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system  
RI-LIO: Reflectivity Image Assisted Tightly-Coupled LiDAR-Inertial Odometry 期刊论文
IEEE Robotics and Automation Letters, 2023, 卷号: 8, 期号: 3, 页码: 1802-1809
作者:  Yanfeng Zhang;  Yunong Tian;  Wanguo Wang;  Guodong Yang;  Zhishuo Li;  Fengshui Jing;  Min Tan
Adobe PDF(7657Kb)  |  收藏  |  浏览/下载:134/6  |  提交时间:2023/04/27
Policy decision of curling in real competition scenes 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2022, 页码: 12
作者:  Xiao, Qian;  Li, Zongmin;  Wang, Xiangdong;  Liu, Yujie;  Li, Yachuan;  Yang, Chaozhi;  Li, Feimo
收藏  |  浏览/下载:412/0  |  提交时间:2023/02/22
Reinforcement learning  Curling policy decision  Game tree search  Deep learning  
一种用于两人零和博弈对手适应的元策略演化学习算法 期刊论文
自动化学报, 2022, 页码: 0
作者:  吴哲;  李凯;  徐航;  兴军亮
Adobe PDF(15953Kb)  |  收藏  |  浏览/下载:246/66  |  提交时间:2022/06/17
面向Ad-Hoc协作的局部观测重建方法 期刊论文
中国科学院大学学报, 2022, 页码: 1
作者:  陈皓;  杨立昆;  尹奇跃;  黄凯奇
Adobe PDF(1491Kb)  |  收藏  |  浏览/下载:260/53  |  提交时间:2022/06/16
多智能体  深度强化学习  信用分配  Ad-Hoc协作  
Attention Enhanced Reinforcement Learning for Multi agent Cooperation 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 15
作者:  Pu, Zhiqiang;  Wang, Huimu;  Liu, Zhen;  Yi, Jianqiang;  Wu, Shiguang
Adobe PDF(2967Kb)  |  收藏  |  浏览/下载:366/56  |  提交时间:2022/06/06
Training  Reinforcement learning  Games  Scalability  Task analysis  Standards  Optimization  Attention mechanism  deep reinforcement learning (DRL)  graph convolutional networks  multi agent systems