CASIA OpenIR

浏览/检索结果: 共107条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
面向兵棋推演的多智能体智能博弈决策算法研究 学位论文
, 2023
作者:  余照科
Adobe PDF(15273Kb)  |  收藏  |  浏览/下载:679/34  |  提交时间:2023/01/31
请输入关兵棋,智能决策,多智能体,深度强化学习,分布式训练键词  
A Multi-modal Global Instance Tracking Benchmark (MGIT): Better Locating Target in Complex Spatio-temporal and causal Relationship 会议论文
, New Orleans, 2023-12
作者:  Shiyu, Hu;  Dailing, Zhang;  Meiqi, Wu;  Xiaokun, Feng;  Xuchen, Li;  Xin, Zhao;  Kaiqi, Huang
Adobe PDF(6215Kb)  |  收藏  |  浏览/下载:62/12  |  提交时间:2024/01/22
Learning to Play Football From Sports Domain Perspective: A Knowledge-Embedded Deep Reinforcement Learning Framework 期刊论文
IEEE TRANSACTIONS ON GAMES, 2023, 卷号: 15, 期号: 4, 页码: 648-657
作者:  Liu, Boyin;  Pu, Zhiqiang;  Zhang, Tianle;  Wang, Huimu;  Yi, Jianqiang;  Mi, Jiachen
收藏  |  浏览/下载:18/0  |  提交时间:2024/02/22
Deformable convolution  football analysis  pitch control  reinforcement learning  
Peer Incentive Reinforcement Learning for Cooperative Multiagent Games 期刊论文
IEEE TRANSACTIONS ON GAMES, 2023, 卷号: 15, 期号: 4, 页码: 623-636
作者:  Zhang, Tianle;  Liu, Zhen;  Pu, Zhiqiang;  Yi, Jianqiang
收藏  |  浏览/下载:15/0  |  提交时间:2024/02/22
Cooperative multiagent games  intrinsic reward  multiagent reinforcement learning (MARL)  Starcraft II Micromanagement  
二人零和动态博弈的自学习平行控制方法研究 学位论文
, 2023
作者:  朱振华
Adobe PDF(1737Kb)  |  收藏  |  浏览/下载:118/5  |  提交时间:2023/12/15
自适应动态规划  平行控制  零和博弈  
Digger: A Graph Contraction Algorithm for Patrolling Games 期刊论文
IEEE TRANSACTIONS ON RELIABILITY, 2023, 页码: 13
作者:  Han, Jinpeng;  Wang, Zhen;  Chen, Xiaoguang;  Yang, Manzhi;  Wang, Fei-Yue
收藏  |  浏览/下载:29/0  |  提交时间:2024/02/22
Games  Security  Game theory  Resource management  Runtime  Roads  Cyberspace  Graph contraction  minimum vertex cut  patrolling game  security game  Stackelberg game  
Synergetic learning for unknown nonlinear H. control using neural networks 期刊论文
NEURAL NETWORKS, 2023, 卷号: 168, 页码: 287-299
作者:  Zhu, Liao;  Guo, Ping;  Wei, Qinglai
收藏  |  浏览/下载:56/0  |  提交时间:2023/12/21
H. control  Nonlinear systems  Adaptive dynamic programming  Temporal difference  Neural network  Data-driven  
Data Generation Feedback Relearning Control for Unmodeled Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 页码: 12
作者:  Zhang, Yong;  Mu, Chaoxu;  Zhao, Dongbin
收藏  |  浏览/下载:58/0  |  提交时间:2023/11/16
Data models  Real-time systems  Heuristic algorithms  Mathematical models  Adaptation models  Approximation algorithms  Cost function  Data generation model  feedback relearning control  delayed neural network  reinforcement learning  unmodeled nonlinear system  
面向工具使用的机器人技能学习方法研究 学位论文
, 2023
作者:  魏俊杭
Adobe PDF(15327Kb)  |  收藏  |  浏览/下载:143/9  |  提交时间:2023/10/25
机器人工具使用  多模态感知  自监督学习  复杂长序任务  
Cognition-Driven Multiagent Policy Learning Framework for Promoting Cooperation 期刊论文
IEEE TRANSACTIONS ON GAMES, 2023, 卷号: 15, 期号: 3, 页码: 388-398
作者:  Pu, Zhiqiang;  Wang, Huimu;  Liu, Boyin;  Yi, Jianqiang
收藏  |  浏览/下载:61/0  |  提交时间:2023/11/16
Cognition difference  coupling cognition network (CCN)  deep reinforcement learning (DRL)  graph convolutional network  multiagent systems (MASs)