CASIA OpenIR

浏览/检索结果: 共17条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
BigyaPAn: Deep Analysis of Old Paper Advertisement 会议论文
, 深圳, 2021-9
作者:  Chandranath Adak;  Tao X(陶显)
Adobe PDF(7853Kb)  |  收藏  |  浏览/下载:8/2  |  提交时间:2024/06/05
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文
, 线上, 2022-02-22
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li JQ(李金秋);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:167/65  |  提交时间:2023/06/29
Semantic Perception Swarm Policy with Deep Reinforcement Learning 会议论文
, Online, 05 December 2021
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(523Kb)  |  收藏  |  浏览/下载:119/48  |  提交时间:2023/06/12
ADEL: Autonomous Developmental Evolutionary Learning for Robotic Manipulation 会议论文
, 北京, 2021-8
作者:  Li YM(李一鸣)
Adobe PDF(9586Kb)  |  收藏  |  浏览/下载:141/19  |  提交时间:2022/06/16
A Multi-Task MRC Framework for Chinese Emotion Cause and Experiencer Extraction 会议论文
, Bratislava, Slovakia, 2021-09
作者:  Haoda Qian;  Qiudan Li;  Zaichuan Tang
Adobe PDF(79001Kb)  |  收藏  |  浏览/下载:343/124  |  提交时间:2022/06/14
Decentralized Event-Driven Constrained Control Using Adaptive Critic Designs 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 15
作者:  Yang, Xiong;  Zhu, Yuanheng;  Dong, Na;  Wei, Qinglai
Adobe PDF(1578Kb)  |  收藏  |  浏览/下载:211/3  |  提交时间:2022/01/27
Adaptive critic designs (ACDs)  adaptive dynamic programming (ADP)  decentralized event-driven control  input constraint  reinforcement learning (RL)  
Hierarchical Motion Learning for Goal-Oriented Movements With Speed-Accuracy Tradeoff of a Musculoskeletal System 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 页码: 14
作者:  Zhou, Junjie;  Zhong, Shanlin;  Wu, Wei
Adobe PDF(5440Kb)  |  收藏  |  浏览/下载:277/48  |  提交时间:2022/01/27
Brain-inspired decision making  Fitts' law  Motion generation  Musculoskeletal system  Speed-accuracy tradeoff (SAT)  
基于多智能体强化学习的城市道路交通信号控制 学位论文
, 中国科学院自动化研究所: 中国科学院自动化研究所, 2021
作者:  刘皓
Adobe PDF(4749Kb)  |  收藏  |  浏览/下载:226/4  |  提交时间:2021/07/02
交通信号控制  强化学习  多智能体  车联网  
Multi-Agent Hierarchical Cognition Difference Policy for Multi-Agent Cooperation 期刊论文
Algorithms, 2021, 期号: 14, 页码: 98
作者:  Huimu Wang;  Zhen Liu;  Jianqiang Yi;  Zhiqiang Pu
Adobe PDF(1155Kb)  |  收藏  |  浏览/下载:246/49  |  提交时间:2021/06/24
multiagent system  deep reinforcement learning  variational autoencoder  attention mechanism  
Adaptive Critic Learning for Constrained Optimal Event-Triggered Control With Discounted Cost 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 卷号: 32, 期号: 1, 页码: 91-104
作者:  Yang, Xiong;  Wei, Qinglai
收藏  |  浏览/下载:207/0  |  提交时间:2021/06/15
Nonlinear systems  Optimal control  Robustness  Cost function  Adaptive systems  Adaptive critic designs (ACDs)  adaptive critic learning (ACL)  adaptive dynamic programming (ADP)  constrained optimal control  event-triggered control (ETC)  reinforcement learning (RL)