CASIA OpenIR

浏览/检索结果: 共29条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Policy Iteration Algorithm for Constrained Cost Optimal Control of Discrete-Time Nonlinear System 会议论文
, Shenzhen, China, 2021.7.18-22
作者:  Li, Tao;  Wei, Qinglai;  Li, Hongyang;  Song, Ruizhuo
Adobe PDF(920Kb)  |  收藏  |  浏览/下载:58/24  |  提交时间:2024/05/28
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文
, 线上, 2022-02-22
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li JQ(李金秋);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:214/79  |  提交时间:2023/06/29
Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文
, 线上, 2021-02
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li K(李凯);  Li LJ(李丽娟);  Xing JL(兴军亮)
Adobe PDF(413Kb)  |  收藏  |  浏览/下载:186/66  |  提交时间:2023/06/28
A Collaborative Communication-Qmix Approach for Large-scale Networked Traffic Signal Control 会议论文
, Indianapolis, IN, United States, 2021-9-19
作者:  Chen, Xiaoyu;  Xiong, Gang;  Lv, Yisheng;  Chen, yuanyuan;  Song, bing;  Wang, Feiyue
Adobe PDF(1208Kb)  |  收藏  |  浏览/下载:284/74  |  提交时间:2022/06/16
Information Bottleneck Disentanglement for Identity Swapping 会议论文
, 线上, 2021.6.19
作者:  Gao, Gege;  Huang, Huaibo;  Fu, Chaoyou;  Li, Zhaoyang;  He, Ran
Adobe PDF(7781Kb)  |  收藏  |  浏览/下载:165/50  |  提交时间:2022/06/15
A Multi-Task MRC Framework for Chinese Emotion Cause and Experiencer Extraction 会议论文
, Bratislava, Slovakia, 2021-09
作者:  Haoda Qian;  Qiudan Li;  Zaichuan Tang
Adobe PDF(79001Kb)  |  收藏  |  浏览/下载:370/130  |  提交时间:2022/06/14
Robust Texture-Aware Computer-Generated Image Forensic: Benchmark and Algorithm 期刊论文
IEEE Transactions on Image Processing, 2021, 卷号: 30, 页码: 8439-8453
作者:  Bai, Weiming;  Zhang, Zhipeng;  Li, Bing;  Wang, Pei;  Li, Yangxi;  Zhang, Congxuan;  Hu, Weiming
Adobe PDF(4552Kb)  |  收藏  |  浏览/下载:222/65  |  提交时间:2022/06/14
DIMSAN: Fast Exploration with the Synergy between Density-based Intrinsic Motivation and Self-adaptive Action Noise 会议论文
, 西安, 2021.5.30-2021.6.5
作者:  Li, Jiayi;  Li, Boyao;  Lu, Tao;  Lu, Ning;  Cai, Yinghao;  Wang, Shuo
Adobe PDF(5599Kb)  |  收藏  |  浏览/下载:211/41  |  提交时间:2022/06/14
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Chai, Jiajun;  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin;  Ma, Zhe;  Sun, Kewu;  Ding, Jishiyu
Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:291/38  |  提交时间:2022/01/27
Multi-agent systems  Training  Task analysis  Reinforcement learning  Sun  Learning systems  Semantics  Centralized training with decentralized execution (CTDE)  multiagent  reinforcement learning  StarCraft II  
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Adobe PDF(4187Kb)  |  收藏  |  浏览/下载:266/12  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)