CASIA OpenIR

浏览/检索结果: 共75条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Deep Reinforcement Learning-Based Driving Policy at Intersections Utilizing Lane Graph Networks 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 1 - 16
作者:  Liu, Yuqi;  Zhang, Qichao;  Gao, Yinfeng;  Zhao, Dongbin
Adobe PDF(22863Kb)  |  收藏  |  浏览/下载:2/1  |  提交时间:2024/06/03
Reinforcement Learning  Autonomous Driving  Intersection Navigating  
Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文
Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078
作者:  Qiu JY(邱俊彦);  Haidong Zhang;  Yiping Yang
Adobe PDF(831Kb)  |  收藏  |  浏览/下载:7/1  |  提交时间:2024/05/29
reinforcement learning  dialogue policy learning  curriculum learning  knowledge distillation  
Graph-guided deep hashing networks for similar patient retrieval 期刊论文
Computers in Biology and Medicine, 2024, 卷号: 169, 页码: 107865
作者:  Gu, Yifan;  Yang, Xuebing;  Sun, Mengxuan;  Wang, Chutong;  Yang, Hongyu;  Yang, Chao;  Wang, Jinwei;  Kong, Guilan;  Lv, Jicheng;  Zhang, Wensheng
Adobe PDF(1325Kb)  |  收藏  |  浏览/下载:4/1  |  提交时间:2024/05/28
Similar patient retrieval  Deep hashing  Graph neural networks  Patient representation learning  Electronic health records  
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 卷号: 54, 期号: 3, 页码: 1414-1426
作者:  Li, Tao;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(784Kb)  |  收藏  |  浏览/下载:62/1  |  提交时间:2024/02/22
Performance analysis  Optimal control  Dynamic programming  Iterative algorithms  Upper bound  Measurement  Convergence  Adaptive dynamic programming (ADP)  isoperimetric constraints  nonlinear systems  optimal control  policy iteration  
Multitask Policy Adversarial Learning for Human-Level Control With Large State Spaces 期刊论文
IEEE Transactions on Industrial Informatics Information, 2019, 卷号: 15, 期号: 4, 页码: 2395-2404
作者:  Wang JP(王军平);  You Kang Shi;  Wen Sheng Zhang;  Ian Thomas;  Shi Hui Duan
Adobe PDF(2547Kb)  |  收藏  |  浏览/下载:112/39  |  提交时间:2023/05/05
Attention Enhanced Reinforcement Learning for Multi agent Cooperation 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 15
作者:  Pu, Zhiqiang;  Wang, Huimu;  Liu, Zhen;  Yi, Jianqiang;  Wu, Shiguang
Adobe PDF(2967Kb)  |  收藏  |  浏览/下载:322/46  |  提交时间:2022/06/06
Training  Reinforcement learning  Games  Scalability  Task analysis  Standards  Optimization  Attention mechanism  deep reinforcement learning (DRL)  graph convolutional networks  multi agent systems  
Attention enhanced reinforcement learning for multi-agent cooperation 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 期号: 2022, 页码: 1-15
作者:  Zhiqiang Pu;  Huimu Wang;  Zhen Liu;  Jianqiang Yi;  Shiguang Wu
Adobe PDF(2967Kb)  |  收藏  |  浏览/下载:221/41  |  提交时间:2022/04/02
Attention mechanism  deep reinforcement learning (DRL)  graph convolutional networks  multi agent systems  
Self-Learning Robust Control Synthesis and Trajectory Tracking of Uncertain Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 卷号: 52, 期号: 1, 页码: 278-286
作者:  Wang, Ding;  Cheng, Long;  Yan, Jun
收藏  |  浏览/下载:228/0  |  提交时间:2022/03/17
Robust control  Optimal control  Cost function  Trajectory tracking  Nonlinear systems  Feedback control  Dynamical systems  Adaptive critic learning  control synthesis  neural networks  optimization  robust stabilization  tracking design  
An Approximate Neuro-Optimal Solution of Discounted Guaranteed Cost Control Design 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 卷号: 52, 期号: 1, 页码: 77-86
作者:  Wang, Ding;  Qiao, Junfei;  Cheng, Long
收藏  |  浏览/下载:246/0  |  提交时间:2022/03/17
Control design  Cost function  Optimal control  Nonlinear systems  Adaptive systems  Switches  Adaptive learning system  discount factor  guaranteed cost function  neuro-optimal control  uncertainty  
Decentralized Event-Driven Constrained Control Using Adaptive Critic Designs 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 15
作者:  Yang, Xiong;  Zhu, Yuanheng;  Dong, Na;  Wei, Qinglai
收藏  |  浏览/下载:202/0  |  提交时间:2022/01/27
Adaptive critic designs (ACDs)  adaptive dynamic programming (ADP)  decentralized event-driven control  input constraint  reinforcement learning (RL)