CASIA OpenIR

浏览/检索结果: 共311条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
A Novel Parallel Control Method for Optimal Consensus of Nonlinear Multiagent Systems 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 页码: 14
作者:  Jiao, Shanshan;  Wei, Qinglai;  Wang, Fei-Yue
收藏  |  浏览/下载:1/0  |  提交时间:2024/07/22
Consensus control  Optimal control  Performance analysis  Vectors  Heuristic algorithms  Multi-agent systems  Dynamic programming  Adaptive dynamic programming (ADP)  coupled Hamilton-Jacobi  optimal consensus control  parallel control  
Online Off-Policy Reinforcement Learning for Optimal Control of Unknown Nonlinear Systems Using Neural Networks 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 页码: 11
作者:  Zhu, Liao;  Wei, Qinglai;  Guo, Ping
收藏  |  浏览/下载:7/0  |  提交时间:2024/07/03
Adaptive dynamic programming  nonlinear systems  online learning  optimal control  reinforcement learning (RL)  
Boosting On-Policy Actor-Critic With Shallow Updates in Critic 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 10
作者:  Li, Luntong;  Zhu, Yuanheng
收藏  |  浏览/下载:8/0  |  提交时间:2024/07/03
Artificial neural networks  Vectors  Task analysis  Training  Representation learning  Approximation algorithms  Optimization  Actor-critic  deep reinforcement learning (DRL)  proximal policy optimization (PPO)  shallow reinforcement learning (SRL)  
Online Adaptive Dynamic Programming for Optimal Self-Learning Control of VTOL Aircraft Systems With Disturbances 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 卷号: 21, 期号: 1, 页码: 343-352
作者:  Wei, Qinglai;  Yang, Zesheng;  Su, Huaizhong;  Wang, Lijian
收藏  |  浏览/下载:7/0  |  提交时间:2024/07/03
Adaptive dynamic programming (ADP)  VTOL aircraft system  policy iteration  neural network (NN)  optimal control  iterative errors  
Synergetic Learning Neuro-Control for Unknown Affine Nonlinear Systems With Asymptotic Stability Guarantees 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 11
作者:  Zhu, Liao;  Wei, Qinglai;  Guo, Ping
收藏  |  浏览/下载:9/0  |  提交时间:2024/07/03
Approximate dynamic programming (ADP)  neural network  off-policy  optimal control  reinforcement learning (RL)  
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:35/15  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Modeling Socially Normative Navigation Behaviors from Demonstrations with Inverse Reinforcement Learning 会议论文
, Vancouver, British Columbia, Canada, 2019-08-22至2019-08-26
作者:  Xingyuan Gao;  Xiaoguang Zhao;  Min Tan
Adobe PDF(1500Kb)  |  收藏  |  浏览/下载:32/14  |  提交时间:2024/06/21
Self-Modifying State Modeling for Simultaneous Machine Translation 会议论文
, Bangkok, Thailand, August 11–16, 2024
作者:  Donglei, Yu;  Xiaomian, Kang;  Yuchen, Liu;  YU, Zhou;  Chengqing, Zong
Adobe PDF(924Kb)  |  收藏  |  浏览/下载:26/13  |  提交时间:2024/06/20
Learning to Correct Erroneous Words for Document Grounded Conversations 会议论文
, Kuantan, Malaysia, 2023.02.23-2023.02.25
作者:  Junyan Qiu;  Haitao Wang;  Yiping Yang
Adobe PDF(773Kb)  |  收藏  |  浏览/下载:43/18  |  提交时间:2024/06/17
Deep Learning  Natural Language Generation  Dialogue System  Curriculum Learning  
Power Control Based on Deep Reinforcement Learning for Spectrum Sharing 期刊论文
IEEE Transactions on Wireless Communications, 2024, 卷号: 19, 期号: 6, 页码: 4209-4219
作者:  Zhang,Haijun;  Yang,Ning;  Huangfu,Wei;  Long,Keping;  Leung,VictorCM
Adobe PDF(1925Kb)  |  收藏  |  浏览/下载:43/17  |  提交时间:2024/06/12