CASIA OpenIR

浏览/检索结果: 共64条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 页码: 1-13
作者:  Guangzheng Hu;  Yuanheng Zhu;  Haoran Li;  Dongbin Zhao
Adobe PDF(2144Kb)  |  收藏  |  浏览/下载:39/7  |  提交时间:2024/06/05
Games  Q-learning  Task analysis  Optimization  Convergence  Training  Nash equilibrium  Multi-agent reinforcement learning  minimax-Q learning  two-team zero-sum Markov games  
Tri-relational multi-faceted graph neural networks for automatic question tagging 期刊论文
Neurocomputing, 2024, 卷号: 576, 页码: 127250
作者:  Nuojia Xu;  Jun Hu;  Quan Fang;  Dizhan Xue;  Yongxi Li;  Shengsheng Qian
Adobe PDF(2105Kb)  |  收藏  |  浏览/下载:43/20  |  提交时间:2024/06/04
Graph Neural Networks  Community Question Answering  Question Tagging  
Deep Reinforcement Learning-Based Driving Policy at Intersections Utilizing Lane Graph Networks 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 1 - 16
作者:  Liu, Yuqi;  Zhang, Qichao;  Gao, Yinfeng;  Zhao, Dongbin
Adobe PDF(22863Kb)  |  收藏  |  浏览/下载:41/14  |  提交时间:2024/06/03
Reinforcement Learning  Autonomous Driving  Intersection Navigating  
Graph-guided deep hashing networks for similar patient retrieval 期刊论文
Computers in Biology and Medicine, 2024, 卷号: 169, 页码: 107865
作者:  Gu, Yifan;  Yang, Xuebing;  Sun, Mengxuan;  Wang, Chutong;  Yang, Hongyu;  Yang, Chao;  Wang, Jinwei;  Kong, Guilan;  Lv, Jicheng;  Zhang, Wensheng
Adobe PDF(1325Kb)  |  收藏  |  浏览/下载:45/18  |  提交时间:2024/05/28
Similar patient retrieval  Deep hashing  Graph neural networks  Patient representation learning  Electronic health records  
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 卷号: 54, 期号: 3, 页码: 1414-1426
作者:  Li, Tao;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(784Kb)  |  收藏  |  浏览/下载:116/21  |  提交时间:2024/02/22
Performance analysis  Optimal control  Dynamic programming  Iterative algorithms  Upper bound  Measurement  Convergence  Adaptive dynamic programming (ADP)  isoperimetric constraints  nonlinear systems  optimal control  policy iteration  
Multitask Policy Adversarial Learning for Human-Level Control With Large State Spaces 期刊论文
IEEE Transactions on Industrial Informatics, 2019, 卷号: 15, 期号: 4, 页码: 2395-2404
作者:  Wang JP(王军平);  You Kang Shi;  Wen Sheng Zhang;  Ian Thomas;  Shi Hui Duan
Adobe PDF(2547Kb)  |  收藏  |  浏览/下载:150/48  |  提交时间:2023/05/05
Decentralized Event-Driven Constrained Control Using Adaptive Critic Designs 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 15
作者:  Yang, Xiong;  Zhu, Yuanheng;  Dong, Na;  Wei, Qinglai
Adobe PDF(1578Kb)  |  收藏  |  浏览/下载:237/14  |  提交时间:2022/01/27
Adaptive critic designs (ACDs)  adaptive dynamic programming (ADP)  decentralized event-driven control  input constraint  reinforcement learning (RL)  
Hierarchical Motion Learning for Goal-Oriented Movements With Speed-Accuracy Tradeoff of a Musculoskeletal System 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 页码: 14
作者:  Zhou, Junjie;  Zhong, Shanlin;  Wu, Wei
Adobe PDF(5440Kb)  |  收藏  |  浏览/下载:304/52  |  提交时间:2022/01/27
Brain-inspired decision making  Fitts' law  Motion generation  Musculoskeletal system  Speed-accuracy tradeoff (SAT)  
SADRL: Merging human experience with machine intelligence via supervised assisted deep reinforcement learning 期刊论文
NEUROCOMPUTING, 2022, 卷号: 467, 页码: 300-309
作者:  Li, Xiaoshuang;  Wang, Xiao;  Zheng, Xinhu;  Jin, Junchen;  Huang, Yanhao;  Zhang, Jun Jason;  Wang, Fei-Yue
Adobe PDF(1244Kb)  |  收藏  |  浏览/下载:341/76  |  提交时间:2021/12/28
Deep reinforcement learning  Behavioral cloning  Dynamic demonstration  Double DQN  
Adaptive Critic Learning for Constrained Optimal Event-Triggered Control With Discounted Cost 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 卷号: 32, 期号: 1, 页码: 91-104
作者:  Yang, Xiong;  Wei, Qinglai
收藏  |  浏览/下载:246/0  |  提交时间:2021/06/15
Nonlinear systems  Optimal control  Robustness  Cost function  Adaptive systems  Adaptive critic designs (ACDs)  adaptive critic learning (ACL)  adaptive dynamic programming (ADP)  constrained optimal control  event-triggered control (ETC)  reinforcement learning (RL)