CASIA OpenIR

浏览/检索结果: 共119条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Online Off-Policy Reinforcement Learning for Optimal Control of Unknown Nonlinear Systems Using Neural Networks 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 页码: 11
作者:  Zhu, Liao;  Wei, Qinglai;  Guo, Ping
收藏  |  浏览/下载:7/0  |  提交时间:2024/07/03
Adaptive dynamic programming  nonlinear systems  online learning  optimal control  reinforcement learning (RL)  
Online Adaptive Dynamic Programming for Optimal Self-Learning Control of VTOL Aircraft Systems With Disturbances 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 卷号: 21, 期号: 1, 页码: 343-352
作者:  Wei, Qinglai;  Yang, Zesheng;  Su, Huaizhong;  Wang, Lijian
收藏  |  浏览/下载:7/0  |  提交时间:2024/07/03
Adaptive dynamic programming (ADP)  VTOL aircraft system  policy iteration  neural network (NN)  optimal control  iterative errors  
Synergetic Learning Neuro-Control for Unknown Affine Nonlinear Systems With Asymptotic Stability Guarantees 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 11
作者:  Zhu, Liao;  Wei, Qinglai;  Guo, Ping
收藏  |  浏览/下载:8/0  |  提交时间:2024/07/03
Approximate dynamic programming (ADP)  neural network  off-policy  optimal control  reinforcement learning (RL)  
Learn to flap: foil non-parametric path planning via deep reinforcement learning 期刊论文
Journal of Fluid Mechanics, 2024, 卷号: 984, 页码: A9
作者:  Wang, Zhipeng;  Lin, Runji;  Zhao, Zhiyu;  Chen, Xu;  Guo, Pengming;  Yang, Ning;  Wang,Zhicheng;  Fan, Dixia
Adobe PDF(1892Kb)  |  收藏  |  浏览/下载:49/11  |  提交时间:2024/06/07
Enhance Sample Efficiency and Robustness of End-to-end Urban Autonomous Driving via Semantic Masked World Model 期刊论文
IEEE Transactions on Intelligent Transportation Systems, 2024, 页码: 10.1109/TITS.2024.3400227
作者:  Zeyu Gao;  Yao Mu;  Chen Chen;  Jingliang Duan;  Ping Luo;  Yanfeng Lu;  Shengbo Eben Li
Adobe PDF(3954Kb)  |  收藏  |  浏览/下载:40/15  |  提交时间:2024/06/06
End-to-end autonomous driving  deep reinforcement learning  world model  
Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文
Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078
作者:  Qiu JY(邱俊彦);  Haidong Zhang;  Yiping Yang
Adobe PDF(831Kb)  |  收藏  |  浏览/下载:52/19  |  提交时间:2024/05/29
reinforcement learning  dialogue policy learning  curriculum learning  knowledge distillation  
Computational Experiments for Complex Social Systems: Experiment Design and Generative Explanation 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 4, 页码: 1022-1038
作者:  Xiao Xue;  Deyu Zhou;  Xiangning Yu;  Gang Wang;  Juanjuan Li;  Xia Xie;  Lizhen Cui;  Fei-Yue Wang
Adobe PDF(7239Kb)  |  收藏  |  浏览/下载:69/17  |  提交时间:2024/03/18
Agent-based modeling  computational experiments  cyber-physical-social systems (CPSS)  generative deduction  generative experiments  meta model  
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 卷号: 54, 期号: 3, 页码: 1414-1426
作者:  Li, Tao;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(784Kb)  |  收藏  |  浏览/下载:119/21  |  提交时间:2024/02/22
Performance analysis  Optimal control  Dynamic programming  Iterative algorithms  Upper bound  Measurement  Convergence  Adaptive dynamic programming (ADP)  isoperimetric constraints  nonlinear systems  optimal control  policy iteration  
Advancements in Humanoid Robots: A Comprehensive Review and Future Prospects 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 2, 页码: 301-328
作者:  Yuchuang Tong;  Haotian Liu;  Zhengtao Zhang
Adobe PDF(7587Kb)  |  收藏  |  浏览/下载:164/45  |  提交时间:2024/01/23
Future trends and challenges  humanoid robots  human-robot interaction  key technologies  potential applications  
ScoreMix: A Scalable Augmentation Strategy for Training GANs With Limited Data 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 7, 页码: 8920-8935
作者:  Cao, Jie;  Luo, Mandi;  Yu, Junchi;  Yang, Ming-Hsuan;  He, Ran
Adobe PDF(1823Kb)  |  收藏  |  浏览/下载:131/9  |  提交时间:2023/11/17
Generative adversarial networks  image synthesis  data augmentation  few-shot image-to-image translation