CASIA OpenIR

浏览/检索结果: 共27条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Joint caching and transmission in the mobile edge network: An multi-agent learning approach 会议论文
, Madrid, Spain, 2021-12-7
作者:  Mi,Qirui;  Yang,Ning;  Zhang,Haifeng;  Zhang,Haijun;  Wang,Jun
Adobe PDF(1724Kb)  |  收藏  |  浏览/下载:20/7  |  提交时间:2024/06/05
Spiking Adaptive Dynamic Programming with Poisson Process 会议论文
, 中国山东省青岛市, 2021-07-18
作者:  Wei QL(魏庆来);  Han LY(韩立元);  Zhang TL(张铁林)
Adobe PDF(2334Kb)  |  收藏  |  浏览/下载:26/7  |  提交时间:2024/05/28
A New Constrained Cost Value Iteration for Optimal Control of Discrete-Time Nonlinear Systems 会议论文
, Beijing, China, 2021.10.22-24
作者:  Li, Tao;  Wei, Qinglai
Adobe PDF(865Kb)  |  收藏  |  浏览/下载:22/12  |  提交时间:2024/05/28
Omnidirectional Drift Control of an Underwater Biomimetic Vehicle-Manipulator System via Reinforcement Learning 会议论文
, Suzhou, China, May 14-16, 2021
作者:  Ma, Ruichen;  Wang, Yu;  Wang, Rui;  Wang, Shuo
Adobe PDF(855Kb)  |  收藏  |  浏览/下载:97/34  |  提交时间:2023/08/02
Omnidirectional Drift Control  Undulating Fin  Underwater Biomimetic Vehicle-manipulator System (UBVMS)  Reinforcement Learning  Twin Delayed Deep Deterministic policy gradient (TD3)  
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文
, 线上, 2022-02-22
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li JQ(李金秋);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:175/68  |  提交时间:2023/06/29
Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文
, 线上, 2021-02
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li K(李凯);  Li LJ(李丽娟);  Xing JL(兴军亮)
Adobe PDF(413Kb)  |  收藏  |  浏览/下载:156/56  |  提交时间:2023/06/28
ADEL: Autonomous Developmental Evolutionary Learning for Robotic Manipulation 会议论文
, 北京, 2021-8
作者:  Li YM(李一鸣)
Adobe PDF(9586Kb)  |  收藏  |  浏览/下载:143/20  |  提交时间:2022/06/16
A Multi-Task MRC Framework for Chinese Emotion Cause and Experiencer Extraction 会议论文
, Bratislava, Slovakia, 2021-09
作者:  Haoda Qian;  Qiudan Li;  Zaichuan Tang
Adobe PDF(79001Kb)  |  收藏  |  浏览/下载:346/124  |  提交时间:2022/06/14
A Policy-Based Reinforcement Learning Approach for High-Speed Railway Timetable Rescheduling 会议论文
, Indianapolis, IN, USA, 19-22 Sept. 2021
作者:  Yin Wang;  Yisheng Lv;  Jianying Zhou;  Zhiming Yuan;  Qi Zhang;  Min Zhou
Adobe PDF(1210Kb)  |  收藏  |  浏览/下载:179/51  |  提交时间:2022/04/08
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Chai, Jiajun;  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin;  Ma, Zhe;  Sun, Kewu;  Ding, Jishiyu
Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:265/30  |  提交时间:2022/01/27
Multi-agent systems  Training  Task analysis  Reinforcement learning  Sun  Learning systems  Semantics  Centralized training with decentralized execution (CTDE)  multiagent  reinforcement learning  StarCraft II